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ID (54) Title: GALACTURONOS YL TRANSFERASES, NUCLEIC ACIDS ENCODING SAME AND USES THEREFOR 
fs| 

(57) Abstract: The invention provides an isolated nucleic acid molecule encoding the polypeptide having galacturonosyltransferase 
O (GalAT) activity. The GALA T 1 disclosed represents the first pectin biosynthetic glycosyltransferase gene isolated from plants. 
^ The invention further provides 14 GALAT and 10 GALAT-like gene superfamily members. The identification of the GALAT gene 
O superfamily offers new opportunities to modulate pectin synthesism vivo and in vitro by modulating the GALAT gene, for example, 
O transgenic plants that produce modified pectins can be generated by altering the GALAT gene. Since modified pectins are predicted to 
affect plant growth, development, and plant defense responses, the transgenic plants are expected to have improved agricultural value. 
The modified pectins isolated from such transgenic plants are useful as gelling and stabilizing agents in the food, neutraceutical, and 
pharmaceutical industries. The expressed proteins, and variants thereof, of the GALAT superfamily are useful to produce in vitro 
modified pectins of commercial value. 
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GALACTURONOSYLTRANSFERASES, NUCLEIC ACIDS ENCODING SAME 

AND USES THEREFOR 

5 CROSS REFERENCE TO RELATED APPLICATIONS 

The present application claims benefit of United States Provisional 
Patent Application No. 60/445,539 filed February 6, 2003, which is incorporated In 
its entirety herein by reference to the extent not inconsistent herewith. 

10 

BACKGROUND 

This invention relates to plant physiology, growth, development, defense 
and, in particular, to plant genes, termed galacturonosyltransferases (GALATs), 
15 nucleic acids encoding same and the uses therefor. 

Pectins are the most complex polysaccharides in the plant cell wall. They 
comprise 30-40% of the primary wall of dicots and non-graminaceous monocots, 
and ~ 10% of the primary wall in the grass family. Pectins are a family of 
polysaccharides 6,8,27 that include homogalacturonan (HGA) (Fig. 1), 
rhamnogalacturonan-l (RG-I) (Fig. 2) and rhamnogalacturonan II (RG-II) (Fig. 3) as 
well as xylogalacturonans (XGA) 32,34,38 and apiogalacturonans. 6,37 While the 
specific structure of each of these polysaccharides differs as shown in Figs. 1-3, 
they are grouped into one family since they appear to be linked to each other in the 
wall and they each contain a-D-galacturonic acid connected by a 1 ,4-linkage. 

HGA is the most abundant pectic polysaccharide, accounting for ~55%-70% 
of pectin 39 . HGA is a linear homopolymer of a1 ,4-linked D-galactosyluronic acid 
that is partially methylesterified at the C6 carboxyl group and may be partially 
30 acetylated at 0-2 and/or 0-3 8 (Fig. 1). Some plants also contain HGA that is 
substituted at the 2 or 3 position by D-apiofuranose, the so-called 
apiogalacturonans (AGA) 36,37 and/or HGA that is substituted at the 3 position with 
D-xylose 32 " 35 , so-called xylogalacturonan (XGA). RG-II is a complex polysaccharide 
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that accounts for approximately 10-11% of pectin 8 - 39 . RG-II has an HGA backbone 
with four structurally complex side chains attached to C-2 and/or C-3 of the GalA 8,27 
(Fig. 3). Rhamnogalacturonan I (RG-I) accounts for 20-35% of pectin 39 (Fig. 2). 
RG-I is a family of polysaccharides with an alternating [->4)-cc-D-GalA-(1 -»>2)- a-L- 
5 Rha-(1-»] backbone in which roughly 20-80% of the rhamnoses are substituted by 
arablnan, galactan, or arabinogalactan side branches 6,8,30 . 

Pectins are believed to have multiple roles during plant growth, 
development, and in plant defense responses. For example, pectic 

10 polysaccharides play essential roles in cell wall structure 43 , cell adhesion 44 and cell 
signaling 45,46 . Pectins also appear to mediate pollen tube growth 47 and to have 
roles during seed hydration 48,49 , leaf abscission 50 , water movement 51 , and fruit 
development 47,8 . Oligosaccharides cleaved from pectin also serve as signals to 
induce plant defense responses 52,53 . Studies of mutant plants with altered wall 

15 pectin reveal that modifications of pectin structure leads to dwarfed plants 43 , brittle 
leaves 44 , reduced numbers of side shoots and flowers 54 , malformed stomata 44 and 
reduced cell adhesion 55 . 

Although pectins appear to have multiple roles in plants, in no case has their 
20 specific mechanism of action been determined. One way to directly test the 
biological roles of pectins, and to study their mechanisms of action, is to produce 
plants with specific alterations in pectin structure. This can be done by knocking 
out genes that encode the pectin biosynthetic enzymes. Such enzymes include the 
nucleotide-sugar biosynthetic enzymes and the glycosyltransferases that 
25 synthesize the pectic polysaccharides. Each glycosyltransferase is expected to 
transfer a unique glycosyl residue in a specific linkage onto a specific 
polymeric/oligomeric acceptor. To date, only five 56-59,136 of the more than 200 
predicted wall biosynthetic glycosyltransferases have been funtionally identified at 
the gene level (i.e. enzyme activity of the gene product proven), and none of these 
30 have been shown to encode pectin biosynthetic enzymes. 

Based on the known structure of pectin, at least 58 distinct glycosyl-, methyl- 
and acetyl-transferases are believed to be required to synthesize the family of 



2 



WO 2004/072250 



PCT/US2004/003545 



10 



polymers known as pectin. As shown in the review by Mohnen, D. (2002) "Pectins 
and their Manipulation", G.B. Seymour et ai, Blackwell Publishing and CRC Press, 
Oxford, England, pp. 52-98, and Table I below, a minimum of 4-9 
galacturonosyltranferases are predicted to be required for the synthesis of HGA, 
RG-I, RG-II and possibly for the synthesis of the modified forms of HGA known as 
XGA and AGA. The present invention relates to the identification of the first gene, 
GALAT1, encoding a galacturonosyltranferase and related genes thereto. The 
studies disclosed hereinbetow led the inventors to conclude that the gene GALAT1 
encodes the enzyme known as UDP-GalA:Homogalacturonan cc-1,4- 
Galacturonosyltransferase. 



Table I. List of galacturonosyltransferase activities predicted to be required for pectin 
biosynthesis 9 



Type of 
GalAT 


Working' 
Number 


Parent 
polymer 2 


Enzyme 15 

Acceptor substrate Enzyme activity 


Reffor 
Structure 


D-GalAT 


1 


HGA 


*GalAa1-»4GalA cc1,4-GalAT 


27 


D-GalAT 


2 


RG-I 


L-Rhaa1-*4GalA cc1,2-GalAT 


27-29 


D-GalAT 


3 


RG-II 


L-Rhap1-»3Apif cc1,2-GalAT 


30,31 


D-GalAT 


4 


RG-II 


L-Rhapi->-3Apif $1,3GalAT 


353T 


D-GalAT 


5?" 


RG-I/HGA 


GalAoc1->2LRha cc1,4-GalAT 




D-GalAT 


6? 


RG-II/HGA 


GalAa1-»4GalA <x1,4-GalAT 




D-GalAT 


7? 


XGA 


GalA<x1->4(Xyl pl-^GalA" a1,4-GalAT 


3Z35 


D-GalAT 


8? 


AGA 


GalAa1->4(ApifB1-»2)GalA a1,4-GalAT 


36.37 


D-GalAT 


9? 


AGA 


GalAoc1->4(Apifpi->-3)GalA ce1,4-GalAT 


33^7 



15 



20 



25 
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and on the assumption that HGA is synthesized first, followed by RG-I and RG-II. The 
numbers were given 9 to facilitate a comparison of the enzymes, but final numbering will 
likely correspond to the order in which the genes are identified. 

2 HGA: homogalacturonan; RG-I: Rhamnogalacturonan I; RG-H: Rhamnogalacturonan II; 
XGA: Xylogalacturonan; AGA; Apiogalacturonan. 

3 AII sugars are D sugars and have pyranose rings unless otherwise indicated. 
Glycosyltranferases add to the glycosyl residue on the left* of the indicated acceptor. 
4 The ? means the designated GalAT may be required if a different GalAT in the list does 
not perform the designated function. 

5 Glycosyl residue in the parenthesis is branched off the first GalA. 

Over the years, membrane-bound ct1-4galacturonosyltransferase (GalAT) 
activity has been identified and partially characterized in mung bean 10,11 , tomato 12 , 
turnip 12 , sycamore 13 , tobacco suspension 2 , radish roots 5 , enriched Golgi from pea 7 , 
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Azuki bean 14 , Petunia™, and Arabidopsis (see Table II). The pea GalAT was found 
to be localized to the Golgi 7 with its catalytic site facing the lumenal side of the 
Golgi 7 . These results provided the first direct enzymatic evidence that the 
synthesis of HGA occurs in the Golgi. In in vitro reactions, GalAT adds [ 14 C]GalA 

5 from UDP-[ 14 C]GalA 1,60 onto endogenous acceptors in microsomal membrane 
preparations to produce radiolabeled products of large molecular mass (I.e. -105 
kd in tobacco microsomal membranes 2 and £ 500 kd in pea Golgi 7 ). The cleavage 
of up to 89% of the radiolabeled product into GalA, digalacturonic acid (diGalA) and 
trigalacturonic acid (triGalA) following exhaustive hydrolysis with a purified 

10 endopolygalacturonase confirmed that the product synthesized by tobacco GalAT 
was largely HGA. Thus, the crude enzyme catalyzes the reaction in vitro: UDP- 
GalAT + HGA(n) -» HGA(n+1) + UDP. The product produced in vitro in tobacco 
microsomes was ~ 50% esterified 2 while the product produce in pea Golgi did not 
appear to be heavily esterified 7 . These results suggest that the degree of methyl 

15 esterification of newly synthesized HGA may be species specific and that 
methylesterification occurs after the synthesis of at least a short stretch of HGA. 
GalAT in detergent-permeabilized microsomes from azuki bean seedlings added 
[ 14 C]GalA from UDP-[ 14 C]GalA onto acid-soluble polygalacturonate (PGA) 
exogenous acceptors 14 . Treatment of the radiolabeled product with a purified 

20 fungal endopolygalacturonase yielded GalA and diGalA, confirming that the activity 
identified was a GalAT comparable to that studied in tobacco and pea. The azuki 
bean enzyme had a surprisingly high specific activity of 1300-2000 pmol mg" 1 rnin' 1 , 
especially considering the large amount (3.1-4.1 nmol mg" 1 min" 1 ) of 
polygalacturonase activity that was also present in the microsomal preparations. 

25 As with the product made by tobacco, no evidence for the processive transfer of 
galactosyluronic acid residues onto the acceptor was obtained (see below). 
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Table II. Comparison of apparent catalytic constants and pH optimum ofHGA-cc1,4- 
galacturonosyltransferases 2 



Enzyme' 


riani 

OUUIUC 


Anna rant VC 

Mp pare III f\m 
for UDP-Ga!A 


dH 
pn 

ODtimum 


Vmax 
(pmol mg" 
1 mln" 1 ) 


Ref 


<j>alAl 


mi mo hoon 

rnuny ut;dn 


1 7 


6.0 


-4700 


10 


GalAT 


mung Dean 


n H 


n d 


n.d. 


B1 " 


GalAT 


pea 


n.d.° 


6.0 


n.d. 


U2 


GalAT 


pea 


n.d. 


n.d. 


n.d. 


f 


GalAT 


sycamore 


770 


n.d. 


? 


13 


GalAT 


tobacco 


8.9 


7.8 


150 


2 


GalAT (sol) ° 


tobacco 


37 


6.3-7.8 


290 


3 


GalAT (sol) " 


Petunia 


170 


7.0 


480 


lb 


GalAT (per)" 


Azuki bean 


140 


6.8-7.8 


2700 


14 



1 Adapted from ref 6. 
5 2 Unless indicated, all enzymes are measured in particulate preparations. 

3 (sol): detergent-solubilized enzyme. 

4 (per): detergent-permeabilized enzyme. 

5 n.d.: not determined. 

10 GalAT can be solubilized from membranes with detergent 3 . Solubilized 

GalAT adds GalA onto the non-reducing end 4 of exogenous HGA 
(oligogalacturonide; OGA) acceptors of a degree of polymerization of at least ten 2 .. 
The bulk of the HGA elongated in vitro by solubilized GalAT from tobacco 
membranes 3 , or detergent-permeabilized Golgi from pea 7 , at roughly equimolar 

15 UDP-GalA:acceptor concentrations is elongated by a single GalA residue. These 
results suggest that solubilized GalAT in vitro acts nonprocessively, (i.e. 
distributively). The apparent lack of in vitro processivity of GalAT was recently 
confirmed by Akita et a/, who, using pyridylaminated oligogalacturonates as 
substrates and high concentrations of UDP-GalA, showed that although OGAs can 

20 be elongated in a "successive" fashion with up to 10 GalA residues by solubilized 
enzyme from petunia pollen 15 , the kinetics of this response suggest a distributive 
mode of action. We have two working hypotheses as to why GalAT in vitro does 
not appear to act processively. One hypothesis is that the solubilized enzyme or 
the enzyme in particulate preparations does not have the required factors, or is not 

25 present in the required complex, to act processively. An alternative hypothesis is 
that for a Golgi-localized enzyme that synthesizes a complex polymer in a confined 
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internal cellular compartment, such as GalAT, with sufficiently high concentrations 
of substrate, it would not necessarily be advantageous for the enzyme to act 
processively. In fact, the reaction velocity could be hindered under such conditions 

• 65 

if the enzyme were processive . 

5 

The apparent kinetic constants and pH optimum for the characterized 
GalATs are shown in Table II. We have performed additional kinetic studies in 
tobacco and radish that suggest that solubilized and membrane bound GalAT may 
have unusual apparent biphasic kinetics. We tested Vo for radish GalAT at 2 pM to 

10 80 mM UDP-GalA and obtained a biphasic curve (Fig. 4), suggesting that the 
kinetics of GalAT, at least in the membrane and soluble fractions, are complex. 
Comparable results were also obtained for the solubilized radish and tobacco 
enzyme. The initial Vo vs [UDP-GalA] curve was hyperbolic and appeared to reach 
an initial maximum Vo of ~ 300 pmol mg" 1 min" 1 at -1 mM UDP-GalA, confirming 

15 previous results reported for tobacco 2,3 . However, at £ 2 mM UDP-GalA there was 
a second hyperbolic increase in GalAT activity that reached a maximum of -2-4 
nmol min" 1 mg" 1 with -20 mM UDP-GalA. In crude enzyme preparations it was not 
possible to determine the basis for the unusual kinetics. One possibility is that two 
GalATs were present, one with a low Km and one with a high Km. Another 

20 possibility is that UDP-GalA is both a substrate and an allosteric regulator of GalAT. 
Alternatively, a more "trivial" explanation is that at low substrate concentrations the 
kinetics of GalAT were effected by a catabolic enzyme (e.g. a phosphodiesterase) 
in the enzyme preparation. 

25 As a first step towards elucidating the role of galacturonosyltransferase 

(GALAT) in pectin synthesis, the inventors herein identified an Arabidopsis gene 
encoding alpha1,4- galacturonosyltransferase 1 (GALAT1). The database 
searches using the amino acid sequence of the GALAT 1 identified fourteen 
additional GALAT family members and ten GAMT-like genes. The identification of 

30 these genes and the availability of the sequence information allow the 
characterization of the enzyme, the use of these genes to produce mutated 
enzymes in vivo and in vitro, and transgenic plants producing modified pectins, and 
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studies of the role of a specific GalAT in pectin synthesis. The advantages of the 
present invention will become apparent in the following description. 

SUMMARY OF THE INVENTION 

5 

The present invention provides an isolated nucleic acid molecule 
encoding the polypeptide having galacturonosyltransferase (GalAT) activity. The 
GALAT 1 disclosed herein represents the first functionally proven pectin 
biosynthetic glycosyltransferase gene isolated from plants. Also provided are 

10 additional 14 GALAT gene family members and 10 GALAT-like genes predicted to 
have galacturonosyltransferase activity. The identification and availability of the 
nucleic acid molecules as a member of the GALAT gene superfamily offer new 
opportunities to modulate pectin synthesis in vivo and in vitro by modulating the 
GALAT gene using various art-known recombinant DNA technology. For example, 

15 transgenic plants that produce modified pectins of desired properties can be 
generated by manipulating the gene encoding the GALAT protein i.e., mutating the 
gene including coding and non-coding sequences, silencing the gene by RNAi 
approach, or by administering a composition that would affect the GalAT activity in 
the plant. Since modified pectins are predicted to affect plant growth, development, 

20 and plant defense responses, the transgenic plants thus modified are expected to 
have improved agricultural value. The modified pectins can be isolated from such 
transgenic plants according to the art-known methods and serve as gelling and 
stabilizing agents of improved properties in the food, neutraceutical, and 
pharmaceutical industries. 

25 

The inventors herein identified the first gene, GALAT1, which encodes a 
pectin biosynthetic enzyme by employing a partial purification-tandem mass 
spectrometry approach combined with a search of the Arabidopsis gene/protein 
database. Two genes, designated JS33 and JS36 herein, were identified as 
30 present only in the GalAT-containing fractions. As demonstrated hereinbelow, the 
expressed protein from the nucleic acid sequence of JS36 indeed exhibits the 
predicted GalAT enzymatic activity. 
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A standard protein blast and a PSI Blast of the NCBI protein database using 
the GALAT1 (JS36) amino acid sequence revealed that GALA 71 is a member of a 
15 member GALAT gene family in Arabidopsis. The genes selected for this family 
have at least 30% amino acid identity and at least 50% amino acid similarity based 
on the PSI Blast. The database search using the GALAT 1 sequence further 
identified 10 GALAT-like genes as shown in Table IV. The genes disclosed herein, 
fifteen GALAT genes and ten GALAT-like genes thus represent the GALAT gene 
superfamily members. 

The availability of the amino acid and nucleotide sequences of the GALAT 
gene superfamily members makes it possible to identify other GALAT homologs in 
other plants. The nucleotide and amino acid sequences of the GALAT genes can 
also be used to generate specific antibodies for the protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the trimeric region of homogalacturonan (HGA). HGA is a 
linear homopolymer of alpha-1 ,4-linked galacturonic acid that may be 
methylesterified at C6 and acetylated at 02 or 03. Substituted galacturonans, 
such as RG-II and apiogalacturonan, have an HGA backbone. 

Fig. 2 shows the representative structure of rhamnogalacturonan I (RG-I). 
RG-I has an alternating [->4)-alpha-D-GalpA-(1->2)-alpha-L-Rhap-(1->>) backbone 
in which roughly 20-80% of the rhamnoses are substituted by arabinans, galactans, 
or arabinogalactans. 

Fig. 3 shows the representative structure for rhamnogalacturonan II (RG-II). 
RG-II has a backbone of 1,4-linked alpha-D-GalpA residues. GalA residues are 
also present in RG-II side chain A. 

Fig. 4 illustrates the GalAT kinetics in radish microsomal membranes. 
Radish microsomal membranes (60-80 \ig protein) were incubated with 70 \xg of 
OGA (DP 7-23) and the indicated concentrations of UDP-GalA. Each reaction 
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contained a small concentration of UDP-[ 14 C]GalA (2-3.6 uM) with larger amounts 
of nonradioactive UDP-GalA. The precipitated reaction products were measured by 
liquid scintillation counting. The data are the averages of duplicate samples from 
three separate experiments. The Y axis is specific activity (pmole min" 1 mg" 1 ). 

5 

Fig. 5 shows the outline of the strategy to identify the gene for GalAT. The 
sequenced Arabidopsis genome allowed the use of a function-based partial 
purification-mass spectrometry approach to identify the putative 
galacturonosyltransferase genes. The sample analyzed in each lane is as follows: 
10 lane 1: homogenate, lane 2: total membranes, lane 3: solubilized proteins, lane 4: 
initial anion exchange purification step. 

Figs. 6A and 6B show the results of RT-PCR experiments; 6A shows the 
results of JS33, JS36, and JS36L (a GalAT family gene with 63% identity to JS36) 
15 using Arabidopsis flower (F), root (R), stem (S), and leaf (L) RNA, and B shows the 
RT-PCR control using Arabidopsis actin gene in the same tissues. 

Fig. 7 is a schematic representation of the transmembrane spanning region 
and the conserved amino acids in the Arabidopsis thaiiana GALAT gene family. 
20 The relative position of the strictly conserved residues among the members of the 
proposed GALAT family is numbered as for JS36 (i.e., GALAT1). The striped 
region from residues 22-44 represents the predicted transmembrane region. 

Fig. 8 demonstrates that recombinant JS36 (At3g61130) has 
25 galacturonosyltransferase (GalAT) activity. Human embryonic kidney cells 
(HEK293) were transiently transfected with the pEAK vector alone, or with pEAK 
vector containing the truncated versions of JS33 or JS36. Total media (1); protein 
immunoabsorbed from the medium using anti-HA epitope: Protein A Sepharose (2); 
and protein immunoabsorbed from the medium using anti-HA epitope Protein G 
30 Sepharose (3) were tested for GalAT activity. Data are the average [ 14 C]GalA 
incorporated into product from duplicate reactions from three separate experiments. 
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Fig. 9 shows the relationship of the Arabidopsis GalAT superfamily including 
the GalAT family and the GalAT-like family. The Neighbor-Joining Tree is based on 
a sequence alignment generated by ClustalX. 

DETAILED DESCRIPTION OF THE INVENTION 

In general the terms and phrases used herein have their art-recognized 
meaning, which can be found by reference to standard texts, journal references 
and contexts known to those skilled in the art. The following definitions are 
provided to clarify their specific use in the context of the invention. 

In the present application, the designation, "GALAT, is used to denote the 
gene for galacturonosyltransferase, "GALAT" is used to denote the protein encoded 
by the gene, and "GalAT" is used to indicate galacturonosyltransferase enzyme 
activity. 

The term, "polypeptide", is used herein interchangeably with "protein" to 
indicate a product encoded by a given nucleic acid. 

The terms, "identity" or "similarity" as used herein, are intended to indicate 
the degree of homology between the two or more nucleic acid or amino acid 
sequences. The degree of identity or similarity can be determined using any one of 
the computer programs that are well known in the art. The National Center for 
Biotechnology Information (NCBI) website on the internet provides detailed 
description and references necessary for this subject. Also see Karlin and Altschul 
(1993) Proc. Natl. Acad. Sci. USA 90:5873-5877; Altschul et al. (1997) Nucl. Acids. 
Res. 25:3389-3402. In the present application, the percent amino acid identity and 
similarity among the GALAT gene family and GALAT-like gene family members 
were carried out using the NCBI Pairwise Blast and Matrix Blosum62 using the 
GALAT1 (JS 36) amino acid sequence. 

A "corresponding" nucleic acid or amino acid or sequence of either, as used 
herein, is one present at a site in a GALAT molecule or fragment thereof that has 
the same structure and/or function at a site in another GALAT molecule, although 
the nucleic acid or amino acid position may not be identical. 



10 



WO 2004/072250 



PCT/US2004/003545 



The term "gene" is used herein in the broadest context and includes a 
classical genomic gene consisting of transcriptional and/or translational regulatory 
sequences and/or a coding region and/or nontranslated sequences (i.e., introns, 5'- 
and 3'-untranslated sequences), or mRNA or cDNA corresponding to the coding 
5 regions (i.e., exons) and 5'- and 3'-untranslated sequences. 

The meaning of a "homolog" as used herein is intended to indicate any gene 
or gene product which has a structural or functional similarity to the gene or gene 
product in point. For example, a new homolog of a given GALAT gene can be 
10 identified either by a database search using the amino acid or nucleic acid 
sequences of a given GALAT gene or by screening appropriate cDNA or genomic 
libraries according to the art-known methods. 

An "expression vector" as used herein, generally refers to a nucleic acid 
15 molecule which is capable of expressing a protein or a nucleic acid molecule of 
interest in a host cell. Typically, such vectors comprise a promoter sequence (e.g. 
TATA box, CATTbox, enhancer etc) fused to a heterologous sequence (i.e., a 
nucleic acid of interest), sense or antisense strand, followed by a transcriptional 
termination sequence, a selectable marker, and other regulatory sequences 
20 necessary for transcription and translation of the nucleic acid of interest. A plant 
expressible promoter is a promoter comprising all the necessary so called 
regulatory sequences for transcription and translation of a gene of interest in plants. 
The linkage between the heterologous sequence and the regulatory sequences 
(e.g., promoter) is "in operable linkage" when a desired product can be made from 
25 the heterologous sequence under the control of the given regulatory sequences. 
An "expression vector" is often used interchangeably with an "expression construct" 
in this sense. 

The term "transgenic plant" as used herein refers to a plant that has been 
30 transformed to contain a heterologous nucleic acid, i.e., a plant expression vector 
or construct for a desired phenotype. The transgenic plant is intended to include 
whole plant, plants parts (stems, roots, leaves etc.) or organs, plant cells, seeds, 
and progeny of same. The transgenic plant having modified pectin of the present 
application is one that has been generated by manipulating the gene encoding the 
35 GALAT protein. This can be achieved, for example, by mutating the gene, silencing 
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the gene by RNAi approach, or by knocking out the gene. The transgenic plants of 
the invention are predicted to have properties such as changes in organ and plant 
size, water transport properties, ease of removal of leaves and fruits via effects on 
abscision, pollen development and release, fruit ripening, root mucilage production, 

5 root growth, root cell cap production and separation, stem elongation, shoot growth, 
flower formation, tuber yield, defense responses against pathogens, and stomata 
opening 8 . Thus, the invention provides new means of improving plants of 
agricultural value. The "modified" pectins are those that exhibit structures and 
properties (e.g., gelling and stabilizing) different from those of the pectins naturally 

10 present in plants. Since galacturonic acid is a component of each of the pectic 
polysaccharides (i.e. HGA, RG-I, RG-II and XGA), a modification of the GalATs that 
add the specific GalAs into the specific polysaccharides is expected to modify the 
unique polymers. Such changes in pectin structure would affect multiple pectin 
properties including ionic interactions between HGA regions, gelation properties, 

15 dimer formation of RG-II molecules, length and degree of branching of RG-I, and 
side branch structure of RG-II. Such modifications are predicted to not only affect 
the biological function of pectin in plants, and the chemical and biological properties 
of pectin extracted and used by the food and cosmetic industries, but also 
properties that affect the use of pectin as a biopolymer for industrial processes, as 

20 a drug delivery polymer, and pectins of medicinal and neutraceutical properties in 
human and animal health. 

The term "mutation" as used herein refers to a modification of the natural 
nucleotide sequence of a nucleic acid molecule made by deleting, substituting, or 
25 adding a nucleotide(s) in such a way that the protein encoded by the modified 
nucleic acid is altered structurally and functionally. The mutation in this sense 
includes those modifications of a given gene outside of the coding region. 

The present invention provides polypeptides and nucleic acids encoding the 
30 polypeptides belonging to a family of the pectin biosynthetic enzyme, 
galacturonosyltransferase (GALAT). Pectins have been implicated in a broad range 
of plant growth phenomena including pollen tube growth 47 , seed hydration 48 * 49 , leaf 
abscission 50 , water movement 128 , and fruit development 8 . In addition, pectic 
oligosaccharides serve as signals 45 during plant development 45 and induce plant 
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defense responses 52 " 53 . Mutant studies have shown that altered pectin structure 
leads to dwarfed plants 43 , brittle leaves 44 , reduced numbers of side shoots and 
flowers 129 , and plants with reduced cell-cell adhesion 130, 55 . Therefore, the present 
invention provides the molecular and biochemical tools needed to identify additional 
5 glycosyltransferases involved in branching of the backbones, and would allow the 
generation of plants with altered pectin structure. While the 25 genes disclosed 
herein represent only ~0.1% of the -28,000 genes in Arabidopsis, they are some of 
the most difficult genes to identify and characterize because of a lack of 
commercially available acceptor substrates and activated glycosyl donor 
10 substrates. 

The GALAT1 gene has high sequence similarity to proteins expressed in 
other plants, thus using the sequences disclosed herein, a person of ordinary skill 
in the art can identify other pectin biosynthetic genes (i.e. homologs) in other plant 

15 species, including agriculturally important plants. Since pectin of very similar 
structure is present in the walls of all flowering plants and gymnosperms, the 
identification of functional pectin biosynthetic genes will greatly facilitate the 
engineering of plants with modified pectin and with altered growth characteristics, 
some of which are expected to yield plants of increased agronomical value. In 

20 addition, mutant plants with defined changes in pectin synthesis can allow the 
dissection of the biological role of each pectic component in plants. The pectin 
biosynthetic genes provide valuable tools for understanding mechanistically how 
pectin is synthesized. The glycosyltransferase-specific antibodies that can be 
generated using the sequences disclosed herein are also within the scope of the 

25 invention and allow the process of pectin assembly in the Golgi to be elucidated. A 
complete understanding of such a polysaccharide cellular trafficking process is 
unknown in any biological system. 

Pectin is found in fruits and vegetables and is used as a gelling and 
30 stabilizing agent in the food industry. Pectin has been shown to have multiple 
beneficial effects on mammalian systems and on human health including the 
inhibition of cancer growth and metastasis, inhibition of cancer metastasis by 
binding of pectic oligosaccharides to cell surface receptors of cancer cells 
(US5834442, US5895784), immunomodulatory effects and stimulation of tumor 



13 



WO 2004/072250 



PCT/US2004/003545 



necrosis factor by macrophages (EP03983113), interaction with mucous cell lining 
of the duodenum and the prevention of ulcers (US4698229, US6024959); and anti- 
complementary activity 125 . Many cancer cells have specific carbohydrate-binding 
protein molecules on their cell surfaces called galectins (galactoside-binding 
5 lectins). Galectins aid in cellular interactions by binding to beta-galactose linked 
molecules on neighboring cancer cells. Galectin-3 is a multifunctional lectin that is 
involved in tumor cell adhesion, metastasis and cancer progression. Blocking 
galectin-3 expression in malignant human breast, papillary and tongue carcinoma 
cells led to reversion of the transformed phenotype and suppression of tumor 

10 growth in nude mice 117 " 119 . a pH-modified citrus pectin is suggested to block 
binding of galectins and inhibit tumor cells adhesion. Pienta et al. 127 showed that 
feeding of pH-modified pectin to rats caused a reduction in metastasis of prostate 
cancer. Similarly, oral administration of pectin to mice carrying colon tumors, 
reduced tumor size compared to control animals 114 , reduced metastatic colonization 

15 of B16-F1 melanoma in the lung 120 " 121 and reduced human breast and colon 
carcinoma growth, angiogenesis, and metastasis 125 . When prostate cancer 
patients were fed pH-modified citrus pectin, a 30% lengthening in prostate specific 
antigen (PSA) doubling time was observed in 57% of the patients 122 . As 
progression of prostate cancer is evaluated based on the time that it takes for the 

20 PSA to double, the above observations suggested that pectins may reduce tumor 
size. It has also been shown that fruit-derived pectins inhibit the interaction of 
fibroblast growth factor 1 (FGF1) to its receptor (FGFR1) 123 . Defects in the FGF 
signal transduction system are known to disturb cellular regulatory processes 
resulting in cancer, cardiovascular disease and diabetes mellitus. The availability 

25 of the gene(s) encoding galacturonosyltransferase allows the modification of 
neutraceutical or pharmaceutical pectins to provide pectins with novel cell and 
molecule binding activities and thus, with novel and specified anticancer and other 
physiological activities. 

30 In order to identify a gene(s) involved in pectin biosynthesis, the inventors 

used a partial purification-tandem mass spectrometry approach to identify putative 
GALA T genes from Arabidopsis (see Fig. 5 for strategy). GalAT from Arabidopsis 
was partially purified from detergent-solubilized enzyme by sequential passage 
over two or more of the following resins: cation exchange resin SP-Sepharose, 
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reactive green 19 resin, reactive blue 72 resin, reactive yellow 3 resin, and UDP- 
agarose. Proteins obtained from selected fractions from these columns were 
treated with trypsin to generate peptides, and the amino acid sequence of the 
peptides identified by liquid chromatography-tandem mass spectrometry. The 
5 amino sequence thus generated was used to screen the Arabidopsis gene/protein 
database. Thirty unique proteins were solely identified in the GalAT-containing 
fractions (i.e. not present in fractions not containing GalAT activity). Among the 30 
unique proteins that co-purified with GalAT activity, two proteins (designated JS33 
and JS36) were initially identified as Arabidopsis putative GALAT proteins/genes 
10 based on their having at least one predicted transmembrane domain and since they 
contained a predicted glycosyltransferase domain (see CAZy database; 
http://afmb.cnrs-mrs.fr/CAZY/index.htmh . 

These two genes, along with another Arabidopsis gene with high sequence 
15 similarity to JS36 (designated JS36L for JS36-like) (see below) were either cloned 
by RT-PCR (JS36) using mRNA from Arabidopsis flower and stem tissue, or a 
cDNA clone was obtained from the Arabidopsis Biological Resource Center (JS33 
and JS36L). The proteins encoded by these genes each have a predicted single 
transmembrane domain (Table III). The genes were truncated to remove their N- 
20 terminal region including all or most of the predicted transmembrane domain (see 
Table III), and the truncated genes were inserted into a mammalian expression 
vector pEAK10 (Edge BioSystems as modified by Kelley Moremen lab, CCRC) 
containing an N-terminal heterologous signal sequence (targeting the protein for 
secretion into the medium), a polyhistidine (HIS) tag, and two influenza 
25 hemagglutenin (HA) epitopes (useful for immunoabsorption). 

Table III. Predicted characteristics of JS36, JS33 and JS36L proteins. Predictions were made 
using information from the NCBI database and the SOSUI (Classic & Membrane Prediction 
program) at BCM Search Launcher site (http://searchlauncher.bcm.tmc.edu/sea-search/struc- 
30 predicthtmn . " 
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The truncated forms of JS33, JS36 and JS36L, and the vector alone, were 
transiently expressed in human embryonic kidney cells (HEK293 cells) for 46 hours. 
Since the translational fusion proteins constructed contained two copies of the HA 
epitope, the culture medium was collected and a portion was treated with a mouse 
anti-HA lgG1 bound either to Protein A Sepharose or Protein G Sepharose. The 
immunoadsorbed protein was assayed for GalAT activity using UDP-[ 14 C]GalA and 
a mixture of OGA acceptors. Figure 8 shows that the JS36 construct expressed a 
protein exhibiting GalAT activity. These studies establish that JS36 is a GalAT and 
thus we designated the gene GALAT1. 

As mentioned above, analysis of the amino acid sequence of GALAT1 
shows that the expressed protein contains one transmembrane domain. This is in 
agreement with the GalAT activity being membrane bound in all species tested (see 
Mohnen et al. (2002) 9 . Furthermore, the predicted topology of GALAT 1 is that of a 
type-ll membrane protein, in agreement with our previous determination that the 
catalytic site of pea GalAT lies in the lumen of the Golgi. Type-ll membrane 
proteins have a short N-terminal cytosolic tail, a transmembrane region, a stem 
region, and a C-terminal catalytic domain 16 . 

GALAT1 is a member of the Glycosyltransferase Family 8 in the CAZy 
database [database of putative and proven carbohydrate modifying enzymes that 
currently contains 61 different proposed glycosyltransferase families 
(http://afmb.cnrs-mrs.fr/CAZY/index.html) 66 - 67 ]. The presence of GALAT1 in Family 
8 is in agreement with our demonstrated activity of GALAT1 as an a1,4- 
galacturonosyltransferase, since Family 8 is a family of proposed retaining 
glycosyltransferases and GALAT1 is a retaining enzyme, i.e., the ot-configuration in 
the substrate UDP-a-GalA is retained in the product a1 ,4-linked-galacturononan 
(HGA). 

GALAT is expressed in multiple Arabidopsis tissues at multiple times during 
development. We base this on our RT-PCR analysis of RNA from Arabidopsis 
flower, root, stem and leaf tissue (Figs. 6A and 6B) showing that GALAT1 is 
expressed in all these tissues, and based on the 18 EST entries for this gene in the 
TAIR database (http://www.arabidopsisorgA indicating that GALAT1 is expressed 
in developing seed, green siliques, roots and above ground organs. 
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Identification of the GALAT1 Gene Family 

A standard protein blast and a PSI Blast of the NCBI protein database using 
the GALAT1 (JS36) amino acid sequence reveal that GALAT1 is a member of a at 
least 15 member GALAT gene family in Arabidopsis (see Table IV). The genes 
selected for this family have at least 30% amino acid identity and at least 50% 
amino acid similarity based on the PSI Blast. We further compared these genes 
along their entire coding sequences with JS36 using a Pairwise BLAST (Table IV) 
and show that this family of genes has at least 34% identity and at least 52% 
similarity to JS36 in the portion of the genes C-terminal to the membrane spanning 
domain. This identity is comparable to the 37-54% identity shared among the 
proposed ten member Arabidopsis fucosyltransferase gene family (AtFU1-10) 71 . 

Mutant studies provide further evidence that the GalAT family encodes 
GalATs involved in pectin synthesis. We recently used seed received from 
Arabidopsis T-DNA mutant collection (SIGnAL; http://signal.salk.edu/cgi-bin/tdnaexpress) 
to identify and generate six homozygous Arabidopsis GalAT family T-DNA insert 
mutant lines of several members of the GalAT family. We found that one GalAT 
family gene At1g06780, when mutated, produces leaves with cell walls that contain 
reduced amounts of galacturonic acid. Specifically, analysis of walls from 
homozygous mutant line 073484 revealed that the walls had an 18% reduction in 
GalA and a concomitant increase in glucose. None of the other sugars changed. 
Of the three available At1g06780 T-DNA insert lines, no homozygous seed was 
recovered from mutants where the T-DNA was inserted into an exon. Rather, seed 
recovered from such lines had a reduced germination rate. In line 073484, 
however, the T-DNA is inserted in the 5'-UTR, suggesting that it may have a leaky 
phenotype. The results are consistent with gene At1g06780 encoding a GalAT and 
with the identification of the gene family as a GalAT gene family. The GalA content 
of the walls of another Arabidopsis mutant (Quasimodo) is reduced by 25% and 
these plants exhibit decreased cell adhesion 55 , characteristics consistent with the 
Quasimodo gene encoding a GalAT. Quasimodo has 53% amino acid identity and 
72% similarity to GALAT1 and the gene affected in Quasimodo (At3g25140) is a 
member of our proposed GalAT family. There is, however, at present no direct 
enzymatic evidence that the protein encoded by Quasimodo is a functional GalAT. 
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The conserved amino acids in the GALAT gene family are shown in Fig. 7. 
Glycosyltransferases are expected to contain one or more carboxylates at the 
catalytic site. At least one of the carboxylates is expected to coordinate a divalent 
cation associated with the nucleotide-sugar. In many glycosyitransferases the 
metal coordination involves two carboxylates that are often present as DDx, xDD, 
or DDD (the so-called u D(x) D" motif) 72 . 

A PSI Blast against GALAT1 gene (JS36) further identifiedIO genes that 
have high sequence identity (23-29%) and similarity (41-51%) to GALAT1 and form 
a tight cluster of highly similar genes (55-66% identity/67-77% similarity). A 
Neighbor Joining Tree of our proposed Arabidopsis GalAT Superfamily (i.e. the 
proposed GALAT family and the GALAT-Uke family), based on a sequence 
alignment generated by ClustalX 128 , is shown in Fig. 9. The 10 GALA T-like genes 
are all significantly smaller, lacking ~200 amino acids in comparison with the 
GALA T family. Nonetheless, they appear to be targeted to the secretory pathway 
based on annotation of the genes at the Arabidopsis Information Resources. All 10 
genes appear to be expressed in Arabidopsis, since they are represented by one or 
more ESTs in the Arabidopsis EST collection. The GALA T-like genes also contain 

some of the same conserved residues as the GalAT family, namely D-D D— L 

(the predicted "D(x) D" motif) and L F -W--GLG H— G— 

KPW. We group the 10 GALA T-like genes into a family that encode GalATs directly 
involved in pectin synthesis or GalATs with, as yet, unidentified glycosylating 
function. 
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Table IV. Pairwise sequence alignment between JS36 and the other members of proposed GALAT 
gene family. The alignment was done using the NCBI Pairwise BLAST and Matrix Blosum62. The 
% amino acid identity and similarity are shown. In all cases the alignment compares the bulk of the 
C-terminal portion of the proteins on the carboxy-terminal side of the transmembrane region. 
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The expression of the GALAT1 gene in transiently transfected mammalian 
cells as demonstrated herein now allows the production of stably transformed cell 
lines that produce GALAT1 and experiments aimed at characterizing the 
mechanism of the enzyme and at determining the role of GalATI in pectin 
synthesis. Specifically, the substrate specificity of GalATI will indicate whether it 
catalyzes only HGA synthesis, or also plays a role in RG-I and RG-II synthesis. 
Characterization of the kinetics of GalATI can clarify whether or not UDP-GalA is 
both a substrate and an allosteric regulator of the enzyme. Characterization of the 
mutated GalA1 enzyme can provide information regarding amino acids important in 
catalysis and substrate binding. The subcellular location of GALAT1 will provide 
the first framework for where, within the Golgi and plant endomembrane system the 
complex series of pectin biosynthetic reactions occur. The invention can further be 
used to generate transgenic plants with modified pectin, which can provide 
information regarding the role of GALAT1 in pectin synthesis, provide novel 
biosynthesis acceptors, and provide information about the role of pectin in plant 
growth and development. This biosynthesis framework allows further identification 
of GALAT1 binding proteins that would be putative pectin biosynthesis complex 
members. The results of these studies can serve as the foundation for a full in vitro 
reconstitution of functional pectin synthesis complexes. 

GALAT1 has high sequence similarity to 14 other Arabidopsis proteins as 
shown in Table IV and to proteins expressed in other plants. Possible GALAT1 
homologs in other plants are a 68 kd protein expressed in Cicer arietinum 
(chickpea) epicotyls (76% amino acid identity; 87% similarity), a hypothetical 
protein from Oryza sativa (japonica) (59% identify; 75% similarity) and a protein 
from Populus alba (49% identity; 72% similarity). Thus, the results from the study 
of GALAT1 in Arabidopsis can be extended to other plants, including those of high 
agricultural value. 

Heterologous expression of GALAT1 

As described above, the media from human embryonic kidney (HEK293) 
cells transiently infected with recombinant expression vector bearing truncated 
GALAT1 expressed GALATI Whereas transient expression allowed the 
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expression of sufficient GALAT to measure GalAT activity, additional expression 
strategies can be readily devised to produce large quantities of GALAT1 required 
for further characterization of the enzyme and for antibody production. Since the 
transiently expressed N-terminal epitope-tagged GALAT1 expressed in mammalian 
cells was active, one strategy is to produce stably transfected clonal HEK293 
lines 75 expressing the same protein. The alternative strategy is to express the full 
length and N-terminal truncated forms of GALAT1 in the fungal expression system 
Pichia pastoris. These systems were chosen since we and others 56 " 58 have 
successfully used them to express plant glycosyltransferases. 

For expression in P. pastoris, cDNA encoding the entire, and the truncated 
soluble forms of GALAT can be generated by PCR using gene/vector specific 
primers. The PCR products are then subcloned into appropriate Pichia expression 
vectors (Invitrogen, Carlsbad, CA) in which the cDNA is inserted downstream from 
an alcohol oxidase (AOX1) promoter. We have made full length coding sequence 
constructs for expression in the Pichia vector pPIC 3.5. This vector does not 
contain an epitope tag. One can easily make epitope tagged GALAT1 constructs in 
the Pichia vectors pPICz and pPICza (Invitrogen) and determine whether functional 
C-terminal epitope-tagged constructs that do not affect GalAT activity can be 
recovered. Several studies have demonstrated success of the Pichia system 76 " 82 . 
Once a high-GALAT1 -producing line is recovered, production of large amounts of 
protein can be carried out in fermentors or spinner flasks. 

Characterization of Expressed GALAT1 

To begin to address how HGA is synthesized, the kinetics, substrate specificity, 
and structure of the purified recombinantly expressed GALAT1 can be determined 
and compared to the solubilized membrane-bound Arabidopsis GALAT purified by 
immunoadsorption using the polyclonal-antiGALATI (see below). Although the 
characteristics of GalATI are consistent with the enzyme being the/a catalytic 
subunit of the HGA synthase, GALAT1 could be a GalAT involved in RG-II or RG-I 
synthesis. For example, GalAT could represent an RG-l:GalAT that initially 
elongates HGA by a single GalA and then waits for a required NDP-Rha to start 
RG-I backbone synthesis. The kinetics of purified and recombinantly expressed 
GALAT1 for UDP-GalA and a size range of homogalacturonan and pectin 
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acceptors can be determined. The effect of other nucleotide-sugars and 
oligosaccharide substrates on GalAT can also be tested to identify activators and 
inhibitors. 

5 The expressed full length and truncated enzymes can be assayed in a reaction 

buffer in the presence, and absence, respectively, of Triton X-100. The kinetics of 
the enzyme for UDP-GalA can be carried out in a total of 1 pM to 80 mM UDP-GalA 
+ UDP-[ 14 C]GalA. We routinely synthesize UDP-[ 14 C]GalA either by the 4- 
epimerization of UDP-[ 14 C]GlcA 1 or oxidation of UDP-^CJGal 84 since UDP- 

10 [ 14 C]GalA is not commercially available. The effect of different acceptors on 
GALAT1 activity can be conducted using 100 pM UDP-GalA and 0.1-100 pg 
acceptor/ 30 pi reaction. The acceptors to be tested include HGA oligosaccharides 
(oligogalacturonides) of degrees of polymerization ranging from 2-16, 
polygalacturonic acid, commercially available citrus pectin of -30, 60 and 90% 

15 esterification, RG-I and RG-II. The products made using the different acceptors 
can be characterized 2,3 . If RG-I is shown to serve as an acceptor, RG-I backbone 
fragments that have a GalA or a Rha at the non-reducing end can be used to 
determine acceptor specificity. The acceptors can be tested using multiple assays 
including the precipitation assay 2 and a filter assay 63 . The enzymes can also be 

20 tested for the effect of pH, temperature, reducing agents, divalent cations and salts 
on enzyme activity and product structure. 

Characteristics of the recombinant truncated GALAT1 can be compared to 
the GALAT1 solubilized from Arabidopsis membranes by immunoadsorption of the 

25 solubilized GALAT1 using anti-GALAT1 antibody (see section below) bound to 
Protein A or G Sepharose, or by coupling the anti-GALAT1 antibodies to 3M- 
Emphaze resin 86 and using the resin used to purify GALAT1 from solubilized 
Arabidopsis enzyme. If the characteristics of the immunoadsorbed Arabidopsis 
GALAT1 are different from those of the recombinant truncated GALAT1, the 

30 immunoadsorbed GALAT1 can be analyzed by LC tandem mass spectrometry to 
determine if additional proteins are immunoadsorbed with the Arabidopsis 
solubilized GALAT1 that may have modified the activity (e.g. a heteromeric 
complex). 
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The recombinant GALAT1 and the GALAT1 immunoadsorbed-from 
Arabidopsis solubilized membranes can also be treated with A/-glycanase to 
determine if they are A/-glycosylated. To determine if they are O-glycosylated, the 
proteins can be exhaustively treated with A/-glycanase, the released 
5 oligosaccharides removed, and the resulting protein analyzed by TMS methylation 
analysis to determine the glycosyl residue composition of any carbohydrates still 
attached to the protein. Any oligosaccharide released by the N-glycanase 
treatment can also be analyzed by TMS methylation. The results of these 
experiments would indicate whether the native Arabidopsis GalAT is glycosylated 
10 and whether the recombinant forms have the same or different glycosylation 
pattern. Changes in glycosylation could affect GalATI enzyme activity and/or 
substrate binding. GALAT1 is predicted to have 5 or 6 A/-glycosylation sites 
(NetNGIyc 1.0 Prediction; http://www.expasv.ora/sitemaD.htmh . 

15 As mentioned above, we have found that membrane-bound and solubilized 

GalAT activity in tobacco and radish has unusual apparent biphasic kinetics. Thus, 
we are particularly interested in determining if the expressed GALAT1 shows the 
same kinetics, including possible allosteric regulation by UDP-GalA. One can test 
for possible multimeric structure by determining the mass of the enzyme by size 

20 exclusion chromatography and comparing these with the mass obtained by SDS- 
PAGE. The possibility that GALAT1 exists as a heteromultimer can be tested by 
mixing expressed recombinant GALAT1 with solubilized Arabidopsis enzymes and 
immunoadsorbing GALAT1 and proteins bound to it using either an anti-GALAT1 
antibody or an anti-HA epitope antibody (see previous section). 

25 

Production of a series of mutated GALAT1 proteins bv site-directed mutagenesis 

As discussed above, there are 45 conserved amino acids in GALAT1 among 
the 15 members of the GALAT family. To determine the role of these residues in 
30 substrate/acceptor binding and/or catalysis, each amino acid is systematically 
mutated using site-directed mutagenesis. The effect of these mutations on 
GALAT1 specific activity, and where warranted, on Km, Vmax, and acceptor 
specificity (i.e. OGA, RG-I and RG-II) and product size (i.e. enzyme processivity) is 
determined. 
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Production and use of antibodies 

Anti-GalAT antibodies are necessary for the immunocytochemistry 
experiments, to immunopurify solubilized GALAT1 from Arabidopsis, and to select 
proteins that potentially bind to GALAT1 and may function in pectin biosynthetic 
enzyme complexes. A skilled artisan can generate anti-GalAT antibodies using the 
nucleic acid or amino acid sequences disclosed herein. This can be accomplished 
by employing the heterologously expressed truncated or full-length GALAT1. 
Alternatively, a small peptide derived from the GALAT1 sequence can be 
synthesized and used to generate anti-GALAT1 antibodies. One can generate 
either polyclonal or monoclonal antibodies. Such antibodies are useful for a range 
of types of experiments, including subcellular immunocytochemistry, 
immunoprecipitation/adsorption, and enzyme activity inhibition studies. Monoclonal 
or polyclonal antibodies, specifically reacting with a protein of interest can be made 
by methods well known in the art. See, e.g., Harlow and Lane (1 988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratories; Goding (1996) Monoclonal 
Antibodies: Principles and Practice, 3rd ed., Academic Press, San Diego, CA, and 
Ausubel et al. (1993) Current Protocols in Molecular Biology, Wiley 
Interscience/Greene Publishing, New York, NY. 

Subcellular localization of GALAT1 

All available data, including the localization of the catalytic domain of GalAT 
in the Golgi lumen 7 , suggest that pectin is synthesized in the Golgi and transferred 
via vesicles to the wall. However, it is not known how the different 
glycosyltransferases function to make specific pectin structures. We predict that 
different glycosyltransferases are localized in a sequential manner to different 
cisternae of the Golgi 22,91 in an order indicative of the order in which pectin is 
synthesized as it moves from the cis, through the medial and to the trans Golgi. 
Evidence from both animal 9293 and plants 94 suggests that, either individually or in 
combination, the transmembrane domain (i.e. the bilayer thickness model 95 ), the N- 
or C-terminal sequences flanking the transmembrane domain, and/or the lumenal 
domain (i.e. the 'kin recognition model' 96 ) contribute to localization of proteins within 
the Golgi system. The anti-GalAT antibodies generated as described above can be 
used to determine the subcellular localization of GALAT1 within the Golgi in order 
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to provide additional information on the role of GalATI in pectin synthesis. For 
example, a location of GALAT1 in the cis and medial Golgi cisternae would be 
consistent with a function of GALAT1 in HGA synthesis, while a localization 
primarily in the late medial or trans Golgi would be more suggestive of a role in RG- 
I or possible RG-II synthesis. It should be noted that such subcompartment 
localization studies, while important and novel for the pectin biosynthetic enzymes, 
are also novel in any species since the "precise location of only a small number of 
the glycosyltransferase proteins within the Golgi apparatus have been 
determined" 93 . Anti-GAI_AT1 antibodies can be used to identify where in the Golgi 
GALAT1 is localized by, for example, immunogold label of thin sections from 
Arabidopsis 97 - 91 ' 98, 99 including both developing Arabidopsis seedlings and growing 
suspension cultures which have cells actively making wall. 

Use of mutants and R NAi to generate and characterize GALAT1 and GalAT aene 
Superfamilv knockouts. 

Double-stranded RNA-mediated interference (RNAi) is a method to study the 
function of genes in plants 100 . Transgenic plants harboring an RNAi construct often 
have reduced expression of the gene-specific mRNA. The resulting plants may 
display either complete gene silencing, thus having a knockout phenotype, or a 
partial "knockout" phenotype due to 'leaky' expression. The RNAi approach should 
allow the suppression of GALAT1 expression and a reduction or loss of GALATI. 
This enables one to elucidate the function of GALAT in pectin synthesis and in the 
plant. Simultaneously, the sequence-indexed T-DNA insertion mutants listed in the 
Salk Institute Genomic Analysis Laboratory (SIGnAL) Arabidopsis T-DNA mutant 
collection (http://siqn al.salk.edu/cgi-bin/tdnaexpress) can be monitored to determine 
if any T-DNA insert lines for GALAT become available. If so, the seed can be 
obtained and the mutants generated therefrom can be characterized (as described 
above). 

The putative pectin biosynthesis mutants can aid in the identification of gene 
function in two ways. The visible phenotypes of the mutants can provide 
information on the biological function of the gene (if there is no redundancy in gene 
function) by demonstrating when during growth and development the particular 
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gene product is needed (as shown above). Structural analysis of the pectin in the 
mutant walls can provide information about the specific enzyme activity of the gene 
in pectin synthesis (as shown above). 

5 Of particular importance regarding pectin synthesis, the cell walls are isolated 

and analyzed for glycosyl residue composition (see above) and linkage to provide 
information about the possible role of GALAT1 in pectin synthesis. 

Identification of the members of HGA biosvnthetic complexes . 

There is growing evidence that glycoconjugates are synthesized by 
complexes of glycosyltransferases and other types of proteins 102 . For example, 
ganglioside synthesis occurs via a tightly regulated formation of multiple 
glycosyltransferase complexes 102 . Thus, any protein members of HGA biosynthetic 

15 complexes can be isolated by immunoadsorbing such proteins bound to GALAT1 
using anti-GALAT1 antibodies or anti-HA epitope antibodies. The 
immunoadsorbed proteins can be identified by SDS-PAGE, removed from the gel, 
and their amino acid sequence determined by LC-tandem mass spectrometry. The 
amino acid sequences thus obtained can then be used to search the available 

20 protein databases for their identities. 

Characterization of mutant phenotvpes and bulking up of seed. 

A person of ordinary skill in the art can use mutant seeds to probe gene 
25 function. For example, the initial mutant seed (often a segregating T3 line, see 
http://signal.salk.edu/tdna_FAQs.html) can be grown and selfed to increase the 
seed stock (T4). Multiple plants from T4 seed can be grown and the presence of, 
for example) the T-DNA insert determined by PCR of plant genomic DNA using a T- 
DNA primer and a gene specific primer. The same DNA can be analyzed with gene 
30 specific primers that should span the T-DNA insertion site. These analyses should 
indicate whether the given plant contains a T-DNA insert and if so, whether it is 
homozygous or heterozygous for the mutation. If necessary, Southern blotting and 
hybridization with the specific genes can be used to determine if the gene contains 
the expected T-DNA insert. Seed homozygous for the T-DNA insertion (when not 
35 lethal) or heterozygous (when no viable TDNA homozygous plants are obtained) 
can be selfed to amplify the seed and, for heterozygous plants, to test for 
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segregation of any phenotype or T-DNA insert. Plants can be scored as 
heterozygous or homozygous by PCR analysis of the T-DNA insert and by any 
visible phenotype. Homozygous or heterozygous plants can be used for growth 
phenotype and cell wall analysis. The seed can also be crossed with wild type 
5 Columbia and then selfed to eliminate the possibility that the lines contain an 
unexpected mutation or additional T-DNA insert(s). 

Growth Phenotype analysis 

10 Several growth parameters of the mutant and wild type plants are recorded 

to yield a general phenotypic characterization of the mutant plants. 134 

Analysis of Cell Walls 

15 Homozygous or heterozygous plants are grown and analyzed for wall 

composition and linkage. Cell walls can, for example, be prepared as alcohol 
insoluble residues (AIRs) from WT and (homozygous) mutant Arabidopsis plant 
tissues 135 . AIRs are prepared by homogenizing leaves and stems (from soil-grown 
plants) and roots (from liquid-cultured plants) in aquous 80% EtOH followed by 

20 washes with absolute EtOH, chloroform-methanol, and acetone. Separate fractions 
containing RG-I, RG-II and oligogalacturonides can be obtained by size-exclusion 
chromatography (SEC) and ion exchange chromatography of the material 
solubilized from the cell walls by treatment with pectin methyl esterase (PME) and 
enofo-polygalacturonase (EPG). The yields, glycosyl residue compositions, and 

25 glycosyl linkage compositions of each fraction can be determined 27 . 
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The nucleotide and amino acid sequences of the fifteen GALAT gene family 
members are shown as follows. 

Sequence #1 fSEQ ID NO:*n 

5 Gene name: At3g61 1 30 

GeneBank accession # for reference: NM_1 15977 Gl:1 841 1855 
Nucleotide sequence of Sequence #1 : 
Positions 1-2022 of CDS of NM_1 15977. 

10 1 atggcgctaa agcgagggct atctggagtt aaccggatta gaggaagtgg tggtggatct 

61 cgatctgtgc ttgtgcttct catatttttc tgtgtttttg cacctctttg cttctttgtt 
121 ggccgaggag tgtatatcga ttcctcaaat gattattcaa ttgtttctgt gaagcagaat 
181 cttgactgga gagaacgttt agcaatgcaa tctgttagat ctcttttctc gaaagagata 
241 ctagatgtta tagcaaccag cacagctgat ttgggtcctc ttagccttga ttcttttaag 

15 301 aaaaacaatt tgtctgcatc atggcgggga accggagtag acccctcctt tagacattct 
361 gagaatccag caactcctga tgtcaaatct aataacctga atgaaaaacg tgacagcatt 
421 tcaaaagata gtatccatca gaaagttgag acacctacaa agattcacag aaggcaacta 
481 agagagaaaa ggcgtgagat gcgggcaaat gagttagttc agcacaatga tgacacgatt 
541 ttgaaactcg aaaatgctgc cattgaacgc tctaagtctg ttgattctgc agtccttggt 

20 601 aaatacagta tttggagaag agaaaatgag aatgacaact ctgattcaaa tatacgcttg 
661 atgcgggatc aagtaataat ggctagagtc tatagtggga ttgcaaaatt gaaaaacaag 
721 aacgatttgt tacaagaact ccaggcccga cttaaggaca gccaacgggt tttgggggaa 
781 gcaacatctg atgctgatct tcctcggagt gcgcatgaga aactcagagc catgggtcaa 
841 gtcttggcta aagctaagat gcagttatat gactgcaagc tggttactgg aaagctgaga 

25 901 gcaatgcttc agactgccga cgaacaagtg aggagcttaa agaagcagag tacttttctg 
961 gctcagttag cagcaaaaac cattccaaat cctatccatt gcctatcaat gcgcttgact 
1021 atcgattact atcttctgtc tccggagaaa agaaaattcc ctcggagtga aaacctagaa 
1081 aaccctaatc tttatcatta tgccctcttt tccgacaatg tattagctgc atcagtagtt 
1 141 gttaactcaa ccatcatgaa tgccaaggat ccttctaagc atgtttttca ccttgtcacg 

30 1201 gataaactca atttcggagc aatgaacatg tggttcctcc taaacccacc cggaaaggca 
1261 accatacatg tggaaaacgt cgatgagttt aagtggctca attcatctta ctgtcctgtc 
1321 cttcgtcagc ttgaatctgc agcaatgaga gagtactatt ttaaagcaga ccatccaact 
1381 tcaggctctt cgaatctaaa atacagaaac ccaaagtatc tatccatgtt gaatcacttg 
1441 agattctacc tccctgaggt ttatcccaag ctgaacaaaa tcctcttcct ggacgatgac 

35 1501 atcattgttc agaaagactt gactccactc tgggaagtta acctgaacgg caaagtcaac 
1561 ggtgcagtcg aaacctgtgg ggaaagtttc cacagattcg acaagtatct caacttttcg 
1621 aatcctcaca ttgcgaggaa cttcaatcca aatgcttgtg gatgggctta tggaatgaac 
1681 atgttcgacc taaaggaatg gaagaagaga gacatcactg gtatatacca caagtggcaa 
1741 aacatgaatg agaacaggac actatggaag ctagggacat tgccaccagg attaataaca 

40 1801 ttctacggat taacacatcc cttaaacaag gcgtggcatg tgctgggact tggatataac 

1861 ccgagtatcg acaagaagga cattgagaat gcagcagtgg ttcactataa cgggaacatg 
1921 aaaccatggt tggagttggc aatgtccaaa tatcggccgt attggaccaa gtacatcaag 
1981 tttgatcacc catatcttcg tcgttgcaac cttcatgaat aa 

45 
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Amino Acid Sequence of Sequence #1: (SEQ ID NO:2) 
GeneBank ID# NP_191672 
Positions 1-673 of NP_191672. 

5 

1 malkrglsgv nrirgsgggs rsvlvlliff cvfaplcffv grgvyidssn dysivsvkqn 
61 Idwrerlamq svrslfskei Idviatstad Igplsldsfk knnlsaswrg tgvdpsfrhs 
121 enpatpdvks nnlnekrdsi skdsihqkve tptkihrrql rekrremran elvqhnddti 
181 Iklenaaier sksvdsavlg kysiwrrene ndnsdsnirl mrdqvimarv ysgiaklknk 

10 241 ndllqelqar Ikdsqrvlge atsdadlprs aheklramgq vlakakmqly dcklvtgklr 
301 amlqtadeqv rslkkqstfl aqlaaktipn pihclsmrlt idyyllspek rkfprsenle 
361 npnlyhyalf sdnvlaasw vnstimnakd pskhvfhlvt dklnfgamnm wfllnppgka 
421 tihvenvdef kwlnssycpv Irqlesaamr eyyfkadhpt sgssnlkyrn pkylsmlnhl 
481 rfylpevypk Inkilflddd iivqkdltpl wevnlngkvn gavetcgesf hrfdkylnfs 

15 541 nphiarnfnp nacgwaygmn mfdlkewkkr ditgiyhkwq nmnenrtlwk Igtlppglit 
601 fyglthplnk awhvlglgyn psidkkdien aawhyngnm kpwlelamsk yrpywtkyik 
661 fdhpylrrcn Ihe 



20 Sequence #2 (SEQ ID NO:3) 
Gene name: At2g38650 

GeneBank accession # for reference: NM_129422 Gl:30687590 
Nucleotide sequence of Sequence #2: 
25 Positions 1 -1 860 of CDS of NM_1 29422 

1 atgaaaggcg gaggcggtgg tggaggaggt ggtggcggag gaaaacgccg gtggaaagtt 
61 ctggtgattg gagttttggt tcttgttatt ctttctatgc ttgttcctct tgctttctta 
121 ctcggtcttc acaatggctt tcactctcct ggatttgtca ctgttcaacc ggcttcttca 

30 181 tttgagagct ttaccagaat caatgctact aagcatacac agagagatgf atccgaacgg 
241 gtcgatgagg ttcttcaaaa aatcaatcca gttcttccca agaaaagcga cataaacgtg 
301 ggttccagag atgtgaatgc aacaagcggc actgattcta aaaaaagagg attaccagtg 
361 tccccaactg ttgttgccaa tccaagccct gcaaataaaa caaaatcgga agcctcatat 
421 acaggtgttc agaggaaaat agtaagtggt gatgaaactt ggagaacttg tgaagtgaaa 

35 481 tatgggagct actgcctctg gagggaggaa aataaggaac caatgaaaga tgccaaggtg 
541 aagcaaatga aggaccagct gtttgtggct agagcatact atcccagtat tgctaaaatg 
601 ccttctcaaa gcaagttgac tcgggatatg aaacagaata tccaagagtt tgagcgtatt 
661 cttagtgaaa gttctcaaga tgctgacctt ccaccacagg ttgataaaaa gttgcagaag 
721 atggaagctg taattgcaaa ggcaaagtct tttccagtcg actgtaacaa tgttgacaag 

40 781 aaattgagac agatccttga tttgactgag gatgaagcta gtttccacat gaaacagagt 
841 gtgttcctct accagcttgc agtacagaca atgcctaaga gtcttcattg cttgtcaatg 
901 cgactaactg tggaacattt caagtcagat tcacttgagg atcccattag tgagaaattt 
961 tcagatccct cattacttca ctttgttatc atctccgata atatactagc atcgtccgtt 
1021 gtgatcaact caacggttgt acatgcaagg gacagtaaaa actttgtttt ccatgtactg 

45 1 081 acagacgagc agaattactt tgcaatgaaa caatggttta ttaggaatcc ttgcaaacaa 
1141 tcaactgttc aagtattgaa cattgaaaaa ctcgagctgg acgattctga tatgaaactg 
1201 tctttgtctg cggagttccg tgtttccttc cccagtggtg accttttggc gtctcaacag 
1261 aatagaacac actacttatc ccttttctct caatctcact atcttcttcc caaattattt 
1321 gacaaattgg agaaggttgt gattctggat gatgacgttg tagtccagcg agacttatct 

50 1 381 cccctttggg accttgatat ggaagggaaa gtgaatggcg ctgttaagtc gtgcactgtg 
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1441 agattgggtc agctaaggag tctcaagaga ggaaattttg ataccaatgc ttgtctctgg 
1501 atgtctggtt tgaatgtcgt tgatcttgct agatggaggg cattgggtgt ttcagaaacc 
1561 tatcaaaaat attataaaga gatgagtagt ggagatgagt cgagcgaagc aattgcattg 
1621 caggcaagct tgctcacatt tcaagaccaa gtatatgctc ttgacgacaa atgggctcta 
5 1681 tcagggcttg gttatgacta ctacatcaat gcacaagcca taaaaaacgc agccatattg 
1741 cactataacg ggaacatgaa gccgtggctt gagctgggaa tcccaaatta caaaaactat 
1801 tggagaaggc atctgagtcg ggaagatcgg ttcttgagtg actgtaacgt gaatccttga 



10 Amino Acid Sequence of Sequence #2: (SEQ ID NO:4) 
GeneBank ID# NP_565893 
Positions 1-619 of NP_565893. 

1 mkgggggggg ggggkrrwkv Ivigvlvlvi Ismlvplafl Iglhngfhsp gfvtvqpass 
15 61 fesftrinat khtqrdvser vdevlqkinp vlpkksdinv gsrdvnatsg tdskkrglpv 

121 sptwanpsp anktkseasy tgvqrkivsg detwrtcevk ygsyclwree nkepmkdakv 
181 kqmkdqlfva rayypsiakm psqskltrdm kqniqeferi Isessqdadl ppqvdkklqk 
241 meaviakaks fpvdcnnvdk klrqildlte deasfhmkqs vflyqlavqt mpkslhclsm 
301 rltvehfksd sledpisekf sdpsllhfvi isdnilassv vinstwhar dsknfvfhvl 
20 361 tdeqnyfamk qwfirnpckq stvqvlniek lelddsdmkl slsaefrvsf psgdllasqq 
421 nrthylslfs qshyllpklf dklekwild ddwvqrdls plwdldmegk vngavksctv 
481 rlgqlrslkr gnfdtnaclw msglnwdla rwralgvset yqkyykemss gdesseaial 
541 qasiltfqdq vyalddkwal sglgydyyin aqaiknaail hyngnmkpwl elgipnykny 
601 wrrhisredr flsdcnvnp 

25 

Sequence #3 (SEQ ID NO:5) 
Gene name: At5g47780 

GeneBank accession # for reference: NM_1241 52 Gl:30695292 
30 Nucleotide sequence of Sequence #3: 
Positions 1-1851 of CDS of NM_124152. 

1 atgatggtga agcttcgcaa tcttgttctt ttcttcatgc tcctcaccgt cgttgctcat 
61 atccttctct acaccgatcc cgctgcctcc ttcaagaccc ccttttctaa acgcgatttc 

35 121 ctcgaggacg taaccgcctt gactttcaat tccgatgaga atcgtttgaa tcttcttcct 

181 cgggaatctc ccgctgtgct cagaggagga ctcgtcggtg ctgtctattc cgataagaat 
241 tcacggcggc tagaccaatt gtctgctcga gttctttccg ccaccgacga tgatactcac 
301 tcacatactg acatttccat caaacaagtc actcatgatg cagcctcaga ctcgcatatt 
361 aatagggaaa atatgcatgt tcaattgacc caacaaacct ctgaaaaagt tgatgagcaa 

40 421 ccagagccta atgcttttgg agctaagaaa gatactggaa acgtgttgat gcctgatgct 
481 caagtgaggc atcttaaaga tcagcttatt agggcaaagg tttatctttc ccttccatct 
541 gcaaaggcca atgctcattt tgtgagagag cttcgactcc gtattaaaga agttcaacgg 
601 gcacttgcag atgcctccaa ggattcggat ctgccaaaga ctgctataga aaagctaaaa 
661 gcaatggagc aaacactggc caaaggcaag cagatccaag atgactgttc tacagtggtc 

45 721 aagaagctac gtgctatgct ccactccgca gatgagcagc tacgggtcca taagaagcaa 
781 accatgtttt tgactcaatt gactgctaag accattccta aaggacttca ctgccttcct 
841 ctgcgcctca ctacagacta ttatgcttta aattcatctg aacaacaatt tccaaatcag 
901 gagaaactag aagatactca gctgtatcac tatgcccttt tctctgataa tgttttggct 
961 acgtcagttg ttgttaactc taccataacc aatgcaaagc atcccttaaa gcatgtcttc 

50 1021 cacatcgtca cagacagact caattatgcg gcaatgagga tgtggttcct ggacaatcca 
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1081 cctggcaaag ccaccatcca ggttcagaat gttgaagaat ttacatggct gaattcaagc 
1141 tacagtcccg ttctcaaaca gcttagttct agatcgatga tagattatta cttcagagcc 
1201 caccatacaa attcagacac caacttgaag ttccggaatc caaaatactt atcgatcctt 
1261 aatcatcttc gtttttactt gcctgagatc tttcccaagc tcagcaaagt gctcttcttg 
1321 gatgatgata tagttgtgca gaaggacctt tctggtcttt ggtcagttga tctgaaaggt 
1381 aatgttaacg gtgctgtaga gacgtgtggg gaaagctttc atcgctttga ccgttatctg 
1441 aacttctcaa atccactcat ttccaagaac tttgaccctc gagcttgtgg ttgggcgtat 
1501 ggtatgaatg tctttgatct ggatgaatgg aagaggcaaa acatcacaga agtttatcat 
1561 cgatggcagg atctgaatca agaccgagaa ttgtggaagc tagggacgtt gccgcctggt 
1621 ctaatcacat tttggagacg aacatatccg ctagaccgga aatggcacat actagggctt 
1681 ggatacaacc cgagtgtgaa ccaaagggat attgagaggg cagccgtgat acactataat 
1741 ggcaacctca aaccatggct agagattggg attccaagat acagaggctt ctggtcaaag 
1801 catgtagact atgagcacgt ttatctcaga gaatgcaaca tcaatcctta g 



Amino Acid Sequence of Sequence #3: (SEQ ID N0:6) 
Genebank ID# NP_568688 
Positions 1-616 of NP_568688. 

1 mmvklrnlvl ffmlltvvah illytdpaas fktpfskrdf ledvtaltfn sdenrlnllp 
61 respavlrgg Ivgavysdkn srrldqlsar vlsatdddth shtdisikqv thdaasdshi 
121 nrenmhvqlt qqtsekvdeq pepnafgakk dtgnvlmpda qvrhlkdqli rakvylslps 
181 akanahfvre Irlrikevqr aladaskdsd Ipktaieklk ameqtlakgk qiqddcstw 
241 kklramlhsa deqlrvhkkq tmfltqltak tipkglhclp Irlttdyyal nsseqqfpnq 
301 ekledtqlyh yalfsdnvla tswvnstit nakhplkhvf hivtdrlnya amrmwfldnp 
361 pgkatiqvqn veeftwlnss yspvlkqlss rsmidyyfra hhtnsdtnlk frnpkylsil 
421 nhlrfylpei fpklskvlfl dddiwqkdl sglwsvdlkg nvngavetcg esfhrfdryl 
481 nfsnpliskn fdpracgway gmnvfdldew krqnitevyh rwqdlnqdre Iwklgtlppg 
541 litfwrrtyp Idrkwhilgl gynpsvnqrd ieraavihyn gnlkpwleig ipryrgfwsk 
601 hvdyehvylr ecninp 



Sequence #4 (SEQ ID MO:7) 
Gene name: At1g06780 

GeneBank accession # for reference: NM_1 00555 Gl:30679825 
Nucleotide sequence of Sequence #4: 
Positions 1-1770 of CDS of NM_1 00555. 

1 atgaaacaaa ttcgtcgatg gcagaggatt ttgatcctcg ctctgctatc gatatcagta 
61 ttcgctccgc ttattttcgt atcgaatcgg cttaagagca tcactcccgt tggtcgtaga 
121 gaatttattg aagagttatc caaaattaga ttcacgacaa atgaccttag acttagcgct 
181 attgaacatg aggatggaga aggcttgaag gggccaaggc tcattctctt caaggatggg 
241 gagtttaatt cgtctgctga aagtgatggt ggtaatactt acaaaaacag ggaagaacaa 
301 gtgattgttt cacagaagat gacagttagc tctgatgaaa agggtcaaat tctaccaaca 
361 gtcaaccaac ttgctaataa aacggatttc aagccccctt tatctaaggg tgaaaagaac 
421 acaagggttc agcccgacag agcaacagat gtgaaaacga aggagatcag agacaaaatt 
481 attcaagcta aagcctacct gaatttcgct ccacctggaa gtaactctca agttgtgaag 
541 gagttgagag gtcggctgaa agagctggaa cggtctgttg gtgatgcaac aaaggacaag 
601 gacttatcaa agggcgctct ccgcagggtg aagcccatgg aaaatgtgtt atataaggct 
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661 agtcgtgtct ttaacaattg ccctgccatc gctaccaaac tccgtgccat gaattataac 
721 acagaagaac aagttcaggc gcagaaaaat caagcagcgt atctaatgca gcttgcagca 
781 aggaccaccc caaaagggct tcactgtctc tcaatgcggc tgacatcaga atacttttca 
841 ctggatcctg aaaaaaggca gatgcctaac cagcaaaatt attttgacgc taatttcaat 
901 cattatgttg tcttctctga caatgttttg gcttcttcag tcgttgttaa ctctacgata 
961 tcttcatcaa aggagccaga aagaatagtc ttccatgtcg tgactgattc acttaattac 
1021 ccagcaatct caatgtggtt tctgctaaac attcaaagta aagctactat ccaaatccta 
1081 aacattgatg atatggatgt cctgcctaga gattatgatc aattactgat gaagcaaaac 
1 141 tctaatgacc caagattcat ttctacactc aatcacgcac gcttctatct cccggatata 
1201 ttcccgggtt tgaacaagat ggtactcttg gaccatgatg tagttgttca aagagattta 
1261 agtagactgt ggagcattga tatgaaagga aaggtggttg gagctgtaga gacttgtctt 
1321 gaaggtgaat cttcatttcg atcaatgagc acatttatta atttctcaga cacatgggtc 
1381 gctgggaaat ttagtcctag agcttgcaca tgggctttcg ggatgaatct aattgatctc 
1441 gaagaatgga gaatacggaa gttgacttct acatacataa aatacttcaa cctgggaaca 
1501 aagagaccat tgtggaaagc tgggagctta ccaataggtt ggttgacttt ctataggcaa 
1561 acattagcat tggacaagag atggcatgtg atggggttag gtcgcgaatc aggagtcaaa 
1621 gcggttgaca tcgaacaagc ggcagttata cactacgatg gggtcatgaa gccgtggttg 
1681 gacattggaa aagagaatta caaacgttac tggaacatac acgtccctta ccatcacacc 
1 741 tacttgcaac agtgcaatct tcaagcttga 



Amino Acid Sequence of Sequence #4: (SEQ ID NO: 8) 
Genebank ID# NP_563771 
Positions 1-589. 



1 mkqirrwqri lilallsisv faplifvsnr Iksitpvgrr efieelskir fttndlrlsa 
61 iehedgeglk gprlilfkdg efnssaesdg gntyknreeq vivsqkmtvs sdekgqilpt 
121 vnqlanktdf kpplskgekn trvqpdratd vktkeirdki iqakaylnfa ppgsnsqwk 
181 elrgrlkele rsvgdatkdk dlskgalrrv kpmenvlyka srvfnncpai atklramnyn 
241 teeqvqaqkn qaaylmqlaa rttpkglhcl smrltseyfs Idpekrqmpn qqnyfdanfn 
301 hywfsdnvl assvwnsti ssskeperiv fhwtdslny paismwflln iqskatiqil 
361 niddmdvlpr dydqllmkqn sndprfistl nharfylpdi fpglnkmvll dhdvwqrdl 
421 srlwsidmkg kwgavetcl egessfrsms tfinfsdtwv agkfspract wafgmnlidl 
481 eewrirklts tyikyfnlgt krplwkagsl pigwltfyrq tlaldkrwhv mglgresgvk 
541 avdieqaavi hydgvmkpwl digkenykry wnihvpyhht ylqqcnlqa 
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Sequence #5 (SEQ ID NO:91 
Gene name: At1g 18580 

GeneBank accession # for reference: AY062444 Gl: 17064735 
Nucleotide sequence of Sequence #5: 
Positions 1-1614 of CDS of AY062444. 



1 atgaggcggt ggccggtgga tcaccggcgg cgaggtagaa ggagattgtc gagttggata 
61 tggtttctcc ttggttcttt ctctgtcgct ggtttagttc tcttcatcgt tcagcattat 
121 caccatcaac aagatccatc ccagctttta cttgagagag acacgagaac cgaaatggta 
181 tctcctcccc atttaaactt cacggaagag gtcacaagtg cttcctcctt ctctaggcag 
241 ttagcagagc aaatgacact tgccaaagct tatgtgttta tagctaaaga gcataataat 
301 cttcatttag cttgggaatt gagttctaag atcagaagtt gtcagctttt gctttccaaa 
361 gcagctatga gaggacaacc tatttcgttt gatgaggcta aaccgattat tactggtcta 
421 tcagctctta tctacaaggc tcaagatgca cattatgata ttgccaccac tatgatgacc 
481 atgaaatctc acatccaagc acttgaagag cgtgcaaatg cagctactgt tcagaccaca 
541 atatttgggc aattggttgc tgaggcatta ccaaagagcc tccactgttt gacgataaag 
601 ctcacatctg attgggtaac agagccatct cgccatgaac tggcagatga gaacagaaac 
661 tcacctagac ttgtcgacaa caacctctac cacttctgca tcttctcgga caacgtgatt 
721 gccacctcgg ttgttgttaa ttcaactgtc tcgaatgctg atcatccaaa gcagcttgtt 
781 ttccacatag tgacgaatcg agtgagctac aaagctatgc aggcctggtt tctaagtaat 
841 gacttcaagg gctcagcaat agagatcagg agcgtagagg agttttcttg gttgaatgct 
901 tcatattctc ctgttgttaa gcaactgctg gacacagatg caagagctta ctatttcggg 
961 gaacagacaa gtcaagatac gatttccgac ccaaaagtga ggaacccaaa gtacttgtca 
1021 ttactgaacc atctcagatt ctacattccg gagatctatc cacagctaga gaagattgtt 
1081 ttcctagacg atgatgttgt tgttcagaaa gatttgactc cactcttctc cttggatctg 
1141 catggaaacg tcaatggagc tgtggaaaca tgtcttgaag cctttcaccg atattacaag 
1201 tatctaaatt tctcgaaccc actcatcagc tcaaagttcg acccacaagc atgtggatgg 
1261 gcttttggta tgaacgtttt tgatctgatc gcttggagga atgcaaacgt gactgctcgg 
1321 taccattact ggcaagatca gaacagagaa cgaacgcttt ggaaactcgg gacactccct 
1381 ccaggtctac tatctttcta tggtctcaca gagccactgg acagaagatg gcatgtcttg 
1441 ggtttaggtt acgatgtgaa catcgataac cgtctgatcg aaacagcagc tgtgattcac 
1501 tataatggta acatgaagcc ttggctaaag ctggctattg gtaggtataa acctttctgg 
1561 ttaaagtttt tgaactcgag ccatccttat ttacaagatt gtgtcacagc ttaa 

Amino Acid Sequence of Sequence #5: (SEQ ID NO: 10) 
Genebank ID# AAK93644 Gl:15293067 
Positions 1-537 of AAK93644. 



1 mrrwpvdhrr rgrrrlsswi wfllgsfsva glvlfivqhy hhqqdpsqll lerdtrtemv 
61 spphlnftee vtsassfsrq laeqmtlaka yvfiakehnn Ihlawelssk irscqlllsk 
121 aamrgqpisf deakpiitgl saliykaqda hydiattmmt mkshiqalee ranaatvqtt 
181 ifgqlvaeal pkslhcltik Itsdwvteps rheladenrn sprlvdnnly hfcifsdnvi 
241 atsvwnstv snadhpkqlv fhivtnrvsy kamqawflsn dfkgsaieir sveefswlna 
301 syspwkqll dtdarayyfg eqtsqdtise pkvmpkyls llnhlrfyip eiypqlekiv 
361 fldddwvqk dltplfsldl hgnvngavet cleafhryyk ylnfsnplis skfdpqacgw 
421 afgmnvfdli awrnanvtar yhywqdqnre rtlwklgtlp pgllsfyglt epldrrwhvl 
481 glgydvnidn rlietaavih yngnmkpwlk laigrykpfw Ikflnsshpy Iqdcvta 
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Sequence #6 (SEQ ID NO: 111 
Gene name: At2g20810 

GeneBank accession # for reference: NM_1 27647 Gl:30681 1 42 
Nucleotide sequence of Sequence #6: 
Positions 1-161 1 of CDS of NM_1 27647. 

1 atgagaagga gaggagggga tagtttccgg agagctggac ggaggaagat ctcgaatgtg 
61 gtatggtggg ttctctctgg tattgccctc ctgctcttct ttctcattct ctccaaagct 
121 ggtcatattg aacctagacc ctctattcct aagcgacgtt accgtaatga caaatttgta 
181 gagggtatga atatgactga ggaaatgttg agtcctactt ccgttgctcg tcaagttaat 
241 gatcagattg ctcttgctaa agcttttgtt gtcattgcta aagaaagtaa gaatcttcag 
301 tttgcttggg acttaagtgc tcagatccgt aactctcagt tgcttttatc gagtgctgct 
361 actaggagaa gtcccttgac tgtcttggaa tctgagtcta ctattcgtga catggctgtt 
421 ttgttatatc aagctcagca gcttcactat gatagtgcta ctatgattat gaggcttaag 
481 gcctcgattc aggctcttga agaacaaatg agttccgtta gcgagaagag ttccaagtat 
541 ggacagattg ctgctgagga agtgcctaag agtctttact gtcttggtgt tcgtctcact 
601 accgaatggt ttcagaattt agacttacag agaactctta aggaaaggag tcgtgttgat 
661 tcgaaactca cggataacag tctctaccat ttctgtgtgt tttccgataa cattattgct 
721 acttctgttg tggttaattc tactgctctc aattccaagg cccctgagaa agttgtgttt 
781 catcttgtga ctaatgagat caactatgct gcaatgaagg cttggttcgc cattaatatg 
841 gacaacctca gaggagtcac tgtggaggtt cagaagttcg aggatttctc atggctgaat 
901 gcttcctatg ttccggtcct caagcagctg caagactctg atacgcaaag ctattatttc 
961 tctggacaca acgatgatgg gcgcactcca atcaaattca ggaaccccaa gtatctttcc 
1021 atgctcaacc atcttaggtt ctacatccct gaagtgtttc ctgcgctgaa gaaggtggtc 
1081 tttcttgatg atgatgttgt agttcagaag gatctttcat ctctcttttc gatcgattta 
1 141 aacaaaaatg tgaacggggc tgttgagacc tgcatggaga ccttccaccg ctaccacaag 
1201 tacttgaact attctcatcc tctcatacgc tcccactttg atccagatgc gtgtgggtgg 
1261 gcgtttggaa tgaacgtctt tgatttagtt gagtggagga agagaaatgt gaccggcata 
1321 taccactact ggcaagaaaa aaacgtggac cggaccttat ggaaactggg aacactacct 
1381 ccaggacttc tgacatttta cgggttaaca gaggcactag aggcgtcctg gcatatcctg 
1441 ggattgggat acacgaatgt ggatgctcgt gtgatagaga aaggagctgt tcttcacttc 
1501 aatgggaact taaagccatg gttgaagatc gggatagaga agtacaaacc tttgtgggag 
1561 agatacgttg attacacttc tccttttatg caacaatgca attttcattg a 

Amino Acid Sequence of Sequence #6: (SEQ ID NO* 12) 
Genebank ID# NP_565485 
Positions 1-536 of NP_565485. 

1 mrrrggdsfr ragrrkisnv vwwvlsgial llfflilska ghieprpsip krrymdkfv 
61 egmnmteeml sptsvarqvn dqialakafv viakesknlq fawdlsaqir nsqlllssaa 
121 trrspltvle sestirdmav llyqaqqlhy dsatmimrlk asiqaleeqm ssvsekssky 
1 81 gqiaaeevpk slyclgvrlt tewfqnldlq rtlkersrvd skltdnslyh fcvfsdniia 
241 tswvnstal nskapekvvf hlvtneinya amkawfainm dnlrgvtvev qkfedfswln 
301 asyvpvlkql qdsdtqsyyf sghnddgrtp ikfrnpkyls mlnhlrfyip evfpalkkvv 
361 fldddvwqk dlsslfsidl nknvngavet cmetfhryhk ylnyshplir shfdpdacgw 
421 afgmnvfdlv ewrkrnvtgi yhywqeknvd rtlwklgtlp pglltfyglt ealeaswhil 
481 glgytnvdar viekgavlhf ngnlkpwlki giekykplwe ryvdytspfm qqcnfh 



34 



WO 2004/072250 



PCTVUS2004/003545 



Sequence #7 (SEQ ID NO: 13) 
Gene name: At2g30575 

GeneBank accession # for reference: NM_1 7981 9 GL30684641 
5 Nucleotide sequence of Sequence #7: 
Positions 1-1833 of NIVM79819. 

1 atgaatcaag ttcgtcgttg gcagaggatt ctgatcctct cgctgctatt gttatctgtt 
61 ttagctccga ttgttttcgt ttcgaatcgg ctcaagagca tcacttccgt cgatagagga 

10 121 gaattcattg aagaattatc cgacattaca gataagaccg aggatgaact tagacttact 

181 gctattgaac aggacgaaga aggcttgaag gagcctaaac gtattctgca ggatcgagat 
241 tttaattctg tggttttgtc aaattcctct gataaaagta atgatactgt gcagtctaat 
301 gagggagacc aaaaaaactt tctctcagaa gttgataagg gaaataatca caaaccaaag 
361 gaggaacaag cagtttcaca gaaaaccaca gtaagctcga atgcggaggt gaaaatttca 

IS 421 gcaagagata ttcaacttaa tcataaaacg gaattccgac ccccttcaag taagagtgaa 

481 aagaatacaa gggttcaact tgaaagagca acagatgaga gggtaaagga gatcagagac 
541 aaaattatcc aagcgaaagc ctatctgaat ttggccctac ctgggaataa ctcccaaatc 
601 gtaaaggagt tgagagttcg aacgaaagag ctggaacggg ctactggtga tactaccaag 
661 gataaatatt tgccaaagag ctctcctaac agattgaagg ccatggaagt tgcgttatac 

20 721 aaggtcagcc gtgcctttca caactgccct gccattgcta ccaaactcca agccatgact 

781 tataaaaccg aagaacaagc tcgggcgcag aagaaacaag cagcatattt aatgcagctt 
841 gcagcaagga ctaccccaaa agggcttcat tgtctctcaa tgcggttgac aacagaatat 
901 tttaccctgg atcacgaaaa aaggcagctt ttgcaacaaa gttataatga tcctgatctc 
961 taccattacg tagtcttctc tgacaatgtt ttggcctctt cggttgttgt taactctaca 

25 1021 atctcctcat caaaggaacc ggataaaata gtattccatg tggtgacaga ttcactcaat 
1081 tacccagcaa tctcaatgtg gtttttacta aacccaagtg gcagagcttc aatccaaatc 
1141 ctaaacattg atgaaatgaa tgtcctgcca ttgtaccatg ctgaattgct gatgaagcaa 
1201 aattcaagtg acccaagaat catttcagcg ctcaaccatg cacgcttcta tctcccagat 
1261 atcttcccag gtctaaacaa gatcgtactc ttcgatcatg atgtagtagt gcaaagggat 

30 1321 ctaactagac tgtggagcct tgatatgacg gggaaagttg ttggagctgt agagacttgt 
1381 cttgaaggtg atccttcata tcgttcgatg gactcattca ttaatttctc agatgcatgg 
1441 gtttctcaga aatttgatcc caaggcttgc acttgggcat tcgggatgaa tctatttgat 
1501 ctcgaagaat ggagaagaca ggagttgact tctgtatacc tgaaatactt cgacctggga 
1561 gtaaaaggac atctgtggaa agcaggggga ttgccagtag gttggttgac ttttttcggg 

35 1621 caaacgtttc cgttggaaaa gagatggaac gtgggtgggt taggtcacga atcaggactc 

1681 agggcaagcg acatcgaaca agcagcggtt atacactacg acgggatcat gaaaccatgg 
1741 ctggacatcg gtatagacaa gtacaagcgc tactggaaca tacatgtacc ttaccatcac 
1801 cctcacttac aacggtgcaa cattcacgat tga 

40 

Amino Acid Sequence of Sequence #7: (SEQ ID NO: 14) 
Genebank ID# NP_850150 
Positions 1-610 of NP_850150. 

45 1 rnnqvrrwqri lilsllllsv lapivfvsnr Iksitsvdrg efieelsdit dktedelrlt 

61 aieqdeeglk epkrilqdrd fnsvvlsnss dksndtvqsn egdqknflse vdkgnnhkpk 
121 eeqavsqktt vssnaevkis ardiqlnhkt efrppsskse kntrvqlera tdervkeird 
181 kiiqakayln lalpgnnsqi vkelrvrtke leratgdttk dkylpksspn rlkamevaly 
241 kvsrafhncp aiatklqamt ykteeqaraq kkqaaylmql aarttpkglh clsmrlttey 

50 301 ftldhekrql Iqqsyndpdl yhyvvfsdnv lassvwnst issskepdki vfhwtdsln 
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361 ypaismwfll npsgrasiqi Inidemnvlp lyhaellmkq nssdpriisa Inharfylpd 
421 ifpglnkivl fdhdvwqrd Itrlwsldmt gkwgavetc legdpsyrsm dsfinfsdaw 
481 vsqkfdpkac twafgmnlfd leewrrqelt svylkyfdlg vkghlwkagg Ipvgwltffg 
541 qtfplekrwn vgglghesgl rasdieqaav ihydgimkpw Idigidkykr ywnihvpyhh 
5 601 phlqrcnihd 

Sequence #8 (SEQ ID NO: 15) 

Gene name: At2g46480 
10 GeneBank accession # for reference: NMJ130212 Gl:22326493 
Nucleotide sequence of Sequence #8: 
Positions 1-1587 of NM_130212. 

1 atgactgatg cttgttgttt gaagggaaac gaggacaaaa tggttcctcg ttttggtcat 

15 61 ggaacctgga taggaaaagc atttaatgat acaccagaga tgttgcatga aaggagtctg 

121 agacaggaaa aaagattgga aagggctaat gagctgatga atgatgatag tctgcaaaag 
181 cttgagacgg cagccatggc acgttccaga tctgtcgatt ctgcaccact aggaaactac 
241 accatttgga aaaatgaata ccggaggggc aagagttttg aagatatgtt acgtttgatg 
301 caagatcaaa tcatcatggc acgagtttac agtggacttg caaagtttac aaacaatctc 

20 361 gccttgcacc aagagataga aacacaacta atgaaactag cttgggagga agaatctact 
421 gatattgatc aggagcagag agtacttgac agtataagag acatgggaca aatactggct 
481 agagcacacg agcagctata tgaatgcaag ttggtgacaa ataagttgag agcaatgcta 
541 caaacagttg aagatgaact cgaaaacgag cagacttata taacgttctt gactcagcta 
601 gcttccaagg cactaccaga tgctatccac tgcttgacca tgcgcttgaa tctagagtat 

25 661 catctcctgc ctttaccgat gagaaatttt ccaaggaggg agaatttgga gaatccaaaa 
721 ctttaccact acgctctctt ctctgataat gtactggctg catcagttgt tgtcaactcc 
781 acagtcatga atgcacagga tccttcaagg catgttttcc accttgtgac tgataagctc 
841 aactttggag caatgagtat gtggtttctg ttgaaccctc ctggagaagc gaccatccat 
901 gtccaaaggt ttgaagattt tacttggctc aactcatctt actctccagt tttgagtcag 

30 961 ctcgagtcag cagctatgaa gaagttctac ttcaagacag cgaggtctga atcagttgaa 
1021 tcaggctcag aaaacctcaa gtaccggtac ccgaaataca tgtcaatgct taaccacctg 
1081 aggttctaca tccctaggat cttcccaaag ttggagaaaa tcttgtttgt tgacgatgat 
1 141 gtggttgttc agaaggattt aactccccta tggtccattg atcttaaagg gaaagtgaat 
1201 gaaaactttg atcccaagtt ctgcggatgg gcttatggga tgaacatctt cgacctgaaa 

35 1 261 gaatggaaga agaacaacat tacagaaact tatcactttt ggcaaaacct gaacgaaaac 
1321 cggactctat ggaaactagg aacattgcca ccagggctca taacgttcta caatctgaca 
1381 caaccacttc agagaaaatg gcacttactt ggactgggtt atgataaagg aatcgatgtc 
1441 aagaagattg aaagatcagc tgttatacat tacaatggac acatgaaacc atggacagag 
1501 atggggataa gcaagtatca gccatattgg acgaagtaca ccaattttga ccatccttac 

40 1 561 atctttactt gcaggctgtt tgagtga 

Amino Acid Sequence of Sequence #8: (SEQ ID NO: 16) 
Genebank ID# NP.J82171 
45 Positions 1-528 of NP_182171. 

1 mtdacclkgn edkmvprfgh gtwigkafnd tpemlhersl rqekrleran elmnddslqk 
61 letaamarsr svdsaplgny tiwkneyrrg ksfedmlrlm qdqiimarvy sglakftnnl 
121 alhqeietql mklaweeest didqeqrvld sirdmgqila raheqlyeck Ivtnklraml 
50 181 qtvedelene qtyitfltql askalpdaih cltmrlnley hllplpmrnf prrenlenpk 
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241 lyhyalfsdn vlaasvwns tvmnaqdpsr hvfhlvtdkl nfgamsmwfl Inppgeatih 
301 vqrfedftwl nssyspvlsq lesaamkkfy fktarsesve sgsenlkyry pkymsmlnhl 
361 rfyiprifpk lekilfvddd wvqkdltpl wsidlkgkvn enfdpkfcgw aygmnifdlk 
421 ewkknnitet yhfwqnlnen rtlwklgtlp pglitfynlt qplqrkwhll glgydkgidv 
5 481 kkiersavih ynghmkpwte mgiskyqpyw tkytnfdhpy iftcrlfe 

Sequence #9 (SEQ ID NO: M) 

Gene name: At3g01040 
10 GeneBank accession # for reference: NM_1 10969 Gl:30678269 
Nucleotide sequence of Sequence #9: 
Positions 1-1602 of CDS of NM 1 10969. 



15 1 atgcagcttc acatatcgcc tagcatgaga agcattacga tatcgagcag caatgagttt 

61 attgatttga tgaagatcaa agtcgcagct cgtcacatct cttaccgaac tctcttccac 
121 actatcttaa tcctcgcttt cttgttacct tttgttttca tcctaaccgc tgttgttacc 
181 cttgaaggtg tcaacaagtg ctcctctttt gattgtttcg ggaggcggct aggaccacgt 
241 cttcttggta ggatagatga ttcagagcag agactagtta gagattttta caaaattcta 

20 301 aatgaagtaa gcactcaaga aattccagat ggtttaaagc ttccagagtc ttttagtcaa 
361 ctggtttcgg atatgaagaa caaccactat gatgctaaaa catttgccct cgtatttcga 
421 gctatggtag agaagtttga aagggattta agggaatcca aatttgcaga actcatgaac 
481 aagcactttg ctgcaagttc aattccaaaa ggaattcact gtctctcttt aagactaacc 
541 gatgaatatt cctccaatgc tcatgcccgg agacagcttc cttccccgga gcttctccct 

25 601 gttctctcag acaatgctta ccaccatttt gttctagcta cagataatat cttagctgca 
661 tcggttgtgg tctcatctgc tgttcaatca tcttcaaaac ccgagaaaat tgtcttccat 
721 gttatcacag acaagaaaac ctatgcgggt atgcattctt ggtttgcact caattctgtt 
781 gctcctgcga ttgttgaagt gaaaagcgtt catcagtttg attggttaac aagagagaat 
841 gttccagttc ttgaagctgt ggaaagccat aacagtatca gaaattatta ccatgggaat 

30 901 catattgctg gtgcaaacct cagcgaaaca acccctcgaa catttgcttc gaaactgcag 
961 tcaagaagtc ccaaatacat atctttgctc aaccatctta gaatatatct accagagctt 
1021 tttccgaact tagacaaggt agtgttctta gatgatgata tagtgataca gaaagattta 
1081 tctccgcttt gggatattga ccttaacggg aaggttaatg gagctgtgga gacttgtcga 
1 141 ggagaagacg tatgggttat gtcaaagcgt cttaggaact acttcaattt ttctcacccg 

35 1201 ctcatcgcaa agcatttaga tcccgaagaa tgtgcttggg cttatggaat gaatatcttt 

1261 gatctacgga cttggaggaa gacaaatatc agagaaacgt atcattcttg gcttaaagag 
1321 aatctgaagt cgaatctaac aatgtggaaa cttggaacat tgcctcctgc tctaatagca 
1381 tttaaaggtc atgttcagcc aatagattcc tcttggcata tgcttggatt aggttatcag 
1441 agcaagacca acttagaaaa tgcgaagaaa gctgcagtga ttcattacaa tggccaatca 

40 1 601 aagccgtggc ttgagatagg tttcgagcat ctcagaccat tctggacaaa atatgttaac 
1561 tactccaatg atttcattaa gaattgtcat atcttggaat ag 



Amino Acid Sequence of Sequence #9: (SEQ ID NO: 18) 
45 Genebank ID# NP_1 86753 

Positions 1-533 of NP_1 86753. 

1 mqlhispsmr sitisssnef idlmkikvaa rhisyrtlfh tililafllp fvfiltawt 
61 legvnkcssf dcfgrrlgpr llgriddseq rlvrdfykil nevstqeipd glklpesfsq 
50 121 Ivsdmknnhy daktfalvfr amvekferdl reskfaelmn khfaassipk gihclslrlt 
181 deyssnahar rqlpspellp vlsdnayhhf vlatdnilaa svwssavqs sskpekivfh 
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241 vitdkktyag mhswfalnsv apaivevksv hqfdwltren vpvleavesh nsirnyyhgn 
301 hiaganlset tprtfasklq srspkyisll nhlriylpel fpnldkwfl dddiviqkdl 
361 splwdidlng kvngavetcr gedvwvmskr Irnyfnfshp liakhldpee cawaygmnif 
421 dlrtwrktni retyhswlke nlksnltmwk Igtlppalia fkghvqpids swhmlglgyq 
5 481 sktnlenakk aavihyngqs kpwleigfeh Irpfwtkyvn ysndfiknch ile 

Sequence #10 fSEQ ID NO: 19) 

Gene name: At3g02350 
10 GeneBank accession # for reference: NM_111102 Gl:18396158 
Nucleotide sequence of Sequence #10: 
Positions 1-1686 of CDS of NM_1 1 1 102. 

1 atggcggtgg ccttccgtgg aggccgggga ggcgtcggat ccggccaatc taccggactt 

15 61 cgtagtttct tctcctaccg gatctttatc tccgctttgt tctcttttct cttcctcgcc 

121 actttctccg tcgttcttaa ctcctctcgt catcagcctc atcaggatca tacattgccg 
181 agtatgggca acgcatatat gcagaggacg tttttggctt tgcaatcgga tccattgaaa 
241 actaggttgg atctgataca caagcaagcc attgatcatt tgacactggt gaatgcgtat 
301 gctgcttacg ctaggaagct aaagcttgat gcttctaagc agcttaagct cttcgaagat 

20 361 ttggctatca acttctcgga tttgcagtcg aaacctggtt tgaaatctgc tgtgtctgat 

421 aatggtaatg ctcttgagga ggattcgttt aggcagcttg agaaagaagt gaaggataag 
481 gtgaagacag cgaggatgat gatcgttgag tctaaagaga gttatgatac acagcttaaa 
541 atccagaagt tgaaagatac aatctttgct gtccaagaac agttgacaaa ggctaagaaa 
601 aacggtgcgg ttgctagctt gatttcagcc aagtcggttc ctaaaagtct tcattgtttg 

25 661 gccatgaggc ttgtaggaga gaggatctct aatcctgaga agtacaagga tgctccacct 
721 gacccagccg cagaggatcc aactctttac cactatgcga ttttctctga taatgtcatt 
781 gctgtgtctg ttgtggtgag atcggttgtg atgaacgctg aggagccatg gaagcatgtc 
841 ttccatgtgg tgacagatcg gatgaatctc gcagccatga aggtgtggtt taagatgcgt 
901 cctttggacc gtggtgccca tgttgagatt aaatccgtgg aggatttcaa gttcttaaac 

30 961 tcttcctatg cgccggtctt gaggcagctt gagtctgcca agttgcagaa gttttacttt 

1021 gagaatcaag ctgagaacgc aactaaagat tcacataacc tcaagttcaa gaaccccaag 
1081 tatctctcga tgttgaacca tctcagattt tacttaccag agatgtatcc gaagctgaat 
1141 aagattttgt tcttggacga tgatgttgtg gtgcagaaag acgtgactgg tttatggaaa 
1201 atcaacttgg atggcaaggt gaatggagcc gttgagacat gttttggttc ttttcatcga 

35 1261 tatggtcaat acttaaactt ctctcatcct ttgatcaaag agaactttaa ccccagtgcc 
1321 tgtgcttggg cctttggaat gaacatattc gatctcaatg cctggagacg cgagaagtgc 
1381 accgatcaat accattactg gcagaacctg aatgaagaca gaactctctg gaaattggga 
1441 actctacctc cgggattgat cacattctat tcaaagacga aatcattgga caaatcatgg 
1501 catgtacttg ggttaggcta ta'acccggga gtgagcatgg acgaaatcag aaatgcagga 

40 1 561 gtgattcatt acaatggaaa catgaaaccg tggctagaca ttgcgatgaa ccaatacaag 
1621 tctctctgga ctaaatatgt tgataacgaa atggagtttg tgcagatgtg caattttggt 
1681 ctctaa 
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Amino Acid Sequence of Sequence #10: (SEQ ID NO: 20) 
Genebank ID# NP_566170.1 
Positions 1-561 of NP_566170. 

5 1 mavafrggrg gvgsgqstg! rsffsyrifi salfsflfla tfswlnssr hqphqdhtlp 

61 smgnaymqrt flalqsdplk trldlihkqa idhltlvnay aayarklkld askqlklfed 
121 lainfsdlqs kpglksavsd ngnaleedsf rqlekevkdk vktarmmive skesydtqlk 
181 iqklkdtifa vqeqltkakk ngavaslisa ksvpkslhcl amrlvgeris npekykdapp 
241 dpaaedptly hyaifsdnvi avswvrsw mnaeepwkhv fhvvtdrmnl aamkvwfkmr 
10 301 pldrgahvei ksvedfkfln ssyapvlrql esaklqkfyf enqaenatkd shnlkfknpk 
361 ylsmlnhlrfylpemypkln kilfldddw vqkdvtglwk inldgkvnga vetcfgsfhr 
421 ygqylnfshp likenfnpsa cawafgmnif dlnawrrekc tdqyhywqnl nedrtlwklg 
481 tlppglitfy sktksldksw hvlglgynpg vsmdeirnag vihyngnmkp wldiamnqyk 
541 slwtkyvdne mefvqmcnfg I 

15 

Sequence #11 (SEQ ID NO: 21^ 
Gene name: at3g25140 

GeneBank accession # for reference: NM_1 1 341 8 Gl:30687767 
20 Nucleotide sequence of Sequence #1 1 : 
Positions 1-1680 of CDS of NM_1 13418. 

1 atggctaatc accaccgact tttacgcggc ggcggatctc cggccataat cggtggcaga 
61 atcacactca cagctttcgc ttccactatc gcactcttcc tcttcactct ctccttcttc 
25 121 ttcgcttcag attctaacga ttctcctgat ctccttcttc ccggtgttga gtactctaat 

181 ggagtcggat ctagaagatc catgttggat atcaaatcgg atccgcttaa gccacggttg 
241 attcagatcc ggaaacaagc tgatgatcat cggtcattag cattagctta tgcttcttac 
301 gcgagaaagc ttaagctcga gaattcgaaa ctcgtcagga tcttcgctga tctttcgagg 
361 aattacacgg atctgattaa caaaccgacg tatcgagctt tgtatgattc tgatggagcc 
30 421 tcgattgaag aatctgtgct taggcaattt gagaaagaag ttaaggaacg gattaaaatg 
481 actcgtcaag tgattgctga agctaaagag tcttttgata atcagttgaa gattcagaag 
541 ctgaaagata cgattttcgc tgttaacgaa cagttaacta atgctaagaa gcaaggtgcg 
60 1 ttttcgagtt tgatcgctgc gaaatcgatt ccgaaaggat tgcattgtct tgctatgagg 
661 ctgatggaag agaggattgc tcaccctgag aagtatactg atgaagggaa agatagaccg 
35 721 cgggagctcg aggatccgaa tctttaccat tacgctatat tttcggataa tgtgattgcg 
781 gcttcggtgg ttgtgaactc tgctgtgaag aatgctaagg agccgtggaa gcatgttttt 
841 cacgttgtga ctgataagat gaatcttgga gctatgcagg ttatgtttaa actgaaggag 
901 tataaaggag ctcatgtaga agttaaagct gttgaggatt atacgttttt gaactcttcg 
961 tatgtgcctg tgttgaagca gttagaatct gcgaatcttc agaagtttta tttcgagaat 
40 1021 aagctcgaga atgcgacgaa agataccacg aatatgaagt tcaggaaccc caagtattta 
1081 tctatattga atcacttgag gttttattta cccgagatgt acccgaaact acataggata 
1141 ctgtttttgg acgatgatgt ggttgtgcag aaggatttaa cgggtctgtg ggagattgat 
1201 atggatggga aagtgaatgg agctgtagag acttgttttg ggtcgtttca tcggtacgct 
1261 caatacatga atttctcaca tcctttgatc aaagagaagt ttaatcccaa agcatgtgcg 
45 1 321 tgggcgtatg gaatgaactt ctttgatctt gatgcttgga gaagagagaa gtgcacagaa 
1381 gaatatcact actggcaaaa tctgaacgag aacagggctc tatggaaact ggggacgtta 
1441 ccaccgggac tgatcacctt ttactcaacc acaaagccgc tggacaaatc atggcatgtg 
1501 cttgggctgg gttacaatcc gagcattagc atggatgaga tccgcaacgc tgcagtggta 
1561 cacttcaacg gtaacatgaa gccatggctt gacatagcta tgaaccagtt tcgaccactt 
50 1621 tggaccaaac acgtcgacta tgacctcgag tttgttcagg cttgcaattt tggcctctga 
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Amino Acid Sequence of Sequence #1 1 : (SEQ ID NO: 22) 
Genebank ID# NP_189150 
Positions 1-559 of NP_189150. 

5 1 manhhrllrg ggspaiiggr itltafasti alflftlsff fasdsndspd lllpgveysn 

61 gvgsrrsmld iksdplkprl iqirkqaddh rslalayasy arklklensk Ivrifadlsr 
121 nytdlinkpt yralydsdga sieesvlrqf ekevkerikm trqviaeake sfdnqlkiqk 
181 Ikdtifavne qltnakkqga fssliaaksi pkglhclamr Imeeriahpe kytdegkdrp 
241 reledpnlyh yaifsdnvia aswvnsavk nakepwkhvf hwtdkmnlg amqvmfklke 
10 301 ykgahvevka vedytflnss yvpvlkqles anlqkfyfen klenatkdtt nmkfrnpkyl 
361 silnhlrfyl pemypklhri Ifldddwvq kdltglweid mdgkvngave tcfgsfhrya 
421 qymnfshpli kekfnpkaca waygmnffdl dawrrekcte eyhywqnlne nralwklgtl 
481 ppglitfyst tkpldkswhv Iglgynpsis mdeirnaaw hfngnmkpwl diamnqfrpl 
541 wtkhvdydle fvqacnfgl 

15 

Sequence #12 (SEQ ID NO: 23) 
Gene name: At3g58790 

GeneBank accession # for reference: NIVM 15741 Gl:22331856 
20 Nucleotide sequence of Sequence #12: 
Positions 1-1623 of CDS of NM_1 15741. 

1 atgaagtttt acatatcagc gacggggatt aagaaggtta cgatatcaaa tcccggcgtc 
61 ggaatcggta aaggaagcgg aggatgtgcg gctgcagcgg cggcgttagc agcgcggaga 

25 121 ttctctagtc gcacgttgtt actgttgctg ctgctgctcg ctatcgtcct cccttttatc 

181 ttcgtcaggt tcgcgtttct cgtcctcgaa tctgcctccg tttgcgattc accactcgat 
241 tgcatgggac tcagactttt ccgtgggggc gacacatctc tgaaaattgg ggaagagttg 
301 acacgggctc tagtggaaga gacgacagat catcaggacg ttaatggaag aggaacgaag 
361 ggatcattgg agtcattcga cgaccttgtt aaggagatga cgttaaaacg ccgtgacata 

30 421 agggcgtttg cttccgtgac taagaagatg ctgttgcaga tggaacgtaa agtccaatca 
481 gcgaaacatc atgagttagt gtactggcat ttagcctctc acggtattcc taaaagcctc 
541 cattgccttt ccctcagatt aactgaagag tactctgtaa atgcaatggc tcgaatgcgt 
601 ttgcctccgc ctgagtccgt atcacgtctg accgacccat cttttcatca tattgtcctc 
661 ctgactgaca atgtccttgc tgcctctgtc gtcatatcgt ctactgtaca aaacgctgtg 

35 721 aatcccgaga agtttgtctt tcatattgtt accgataaga aaacctatac ccctatgcat 
781 gcttggtttg ctatcaactc tgcttcatca ccagttgttg aagtaaaggg acttcatcag 
841 tatgattggc ctcaagaagt gaacttcaaa gttagagaga tgctggacat tcaccgctta 
901 atttggagac gacattatca aaatttgaaa gactctgatt ttagttttgt tgagggtact 
961 catgagcagt ccttgcaagc tctaaatcct agctgccttg cccttttgaa ccatcttcgc 

40 1021 atttacattc ccaagctttt tccagatctc aacaagatag tgttgttgga tgatgatgta 
1081 gtagtacaga gcgatctttc gtctttatgg gaaacggatc tcaacggtaa agttgttggt 
1141 gctgtcgttg attcgtggtg cggagacaac tgttgccccg gaagaaaata caaagactat 
1201 ttcaacttct cacatccttt gatctcatca aacttagttc aagaagactg tgcttggctt 
1261 tctggtatga atgtctttga tctcaaagcc tggagacaaa ccaatattac tgaagcttac 

45 1321 tctacatggc taagactcag tgttaggtca ggactacaat tatggcaacc aggggcttta 
1381 ccaccgacat tacttgcttt caaaggactt acacagtctc ttgaaccatc atggcacgtc 
1441 gctggactag gttctcgatc cgtaaaatcc cctcaagaga ttctgaaatc tgcttcggtt 
1501 ttacatttca gcggtccagc aaaaccgtgg ctagagatca gtaaccctga ggtacgatct 
1561 ctttggtata gatacgtaaa ttcctccgac atcttcgtta gaaaatgcaa aatcatgaac 

50 1621 tga 
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Amino Acid Sequence of Sequence #12: (SEQ ID NO: 24) 
Genebank ID# NP_1 91438.2 
Positions 1-540 of NP_1 91438. 

5 1 mkfyisatgi kkvtisnpgv gigkgsggca aaaaalaarr fssrtlllll lliaivlpfi 

61 fvrfaflvle sasvcdspld cmglrlfrgg dtslkigeel tralveettd hqdvngrgtk 
121 gslesfddlv kemtlkrrdi rafasvtkkm llqmerkvqs akhhelvywh lashgipksl 
181 hclslrltee ysvnamarmr Ipppesvsrl tdpsfhhivl ltdnvlaasv visstvqnav 
241 npekfvfhiv tdkktytpmh awfainsass pwevkglhq ydwpqevnfk vremldihrl 
10 301 iwrrhyqnlk dsdfsfvegt heqslqalnp sclallnhlr iyipklfpdl nkivlldddv 

361 wqsdlsslw etdlngkvvg awdswcgdn ccpgrkykdy fnfshpliss nlvqedcawl 
421 sgmnvfdlka wrqtniteay stwlrlsvrs glqlwqpgal pptllafkgl tqslepswhv 
481 aglgsrsvks pqeilksasv Ihfsgpakpw leisnpevrs Iwyryvnssd ifvrkckimn 

15 Sequence #13 (SEQ ID NO: 25) 

Gene name: At4g38270 

GeneBank accession # for reference: NM_1 1 9989 Gl: 30691 874 
Nucleotide sequence of Sequence #13 
20 Positions 1 -2043 of CDS of NM_1 1 9989. 

1 atgacgacgt tctctacatg cgccgccttt ttatcgctgg tagtagtgct acatgctgtt 
61 catgtcggtg gagccatttt agagtcacaa gcaccccaca gagaacttaa agcttatcgt 
121 ccgctgcaag ataataatct acaggaggtg tatgcttcct cagctgctgc agtgcactac 
25 181 gatccagatc tgaaagatgt gaacatagtt gcgacataca gtgaccatta cggcaatata 
241 cgccttggta gggtgaaaat gggggatctt tcaccttctt gggttttgga gaatcctgcc 
301 tatcaagtta gccgcaaaac aaaaggttcg cagctagtta taccacggga ttcatttcaa 
361 aatgatactg gaatggaaga taatgcaagc cattctacaa ctaatcagac tgatgaaagc 
421 gaaaatcagt ttccaaacgt ggattttgca agcccagcaa aactgaagcg gcagatttta 
30 481 cgtcaggaaa ggagaggtca acgaacttta gagctgatcc gacaagaaaa ggaaactgat 
541 gagcagatgc aagaagcagc cattcagaag tcaatgagct ttgaaaactc agtcataggg 
601 aaatacagta tatggaggag agactatgag agcccaaatg ctgatgctat cttgaagctt 
661 atgagagacc agatcataat ggcaaaagca tatgcaaata ttgccaaatc aaaaaatgta 
721 accaatctgt acgttttctt gatgcagcag tgtggagaaa ataaacgtgt tataggtaaa 
35 781 gcaacctctg atgctgacct tccttcaagc gctcttgatc aagcaaaagc catgggccat 
841 gcactctctc ttgcaaaaga cgagttatat gactgccatg aacttgcaaa aaagttccgg 
901 gccatccttc agtccactga acgcaaagta gatggactga agaaaaaggg aaccttctta 
961 attcagctag ctgccaaaac atttcccaag ccattgcatt gcctgagtct gcagctagcg 
1021 gcagactatt ttattctagg tttcaatgaa gaggatgcag tgaaagagga tgtcagtcaa 
40 1 081 aagaagcttg aagatccttc gctctatcac tatgcgatct tttcggataa cgttctggct 

1141 acatcagtgg tggtgaactc cactgtcttg aatgcaaagg aaccgcagag gcatgtgttc 
1201 catatagtaa ctgacaaact gaattttggt gcaatgaaga tgtggtttcg catcaatgct 
1261 cctgctgatg cgacgattca agttgaaaac ataaatgatt tcaagtggct gaactcctct 
1321 tactgctctg ttctacggca gcttgaatct gcaaggctga aagaatacta tttcaaagca 
45 1381 aatcatcctt catcaatctc agctggcgca gataatctaa agtaccgcaa cccaaagtat 
1441 ctatcgatgc tgaatcatct cagattctac cttcctgagg tttatccgaa gctggagaag 
1501 attctgtttc tagacgatga cattgtggtg cagaaggacc tggcaccact atgggaaata 
1561 gacatgcaag gaaaagtgaa tggtgcggtg gagacgtgca aggagagctt ccacagattt 
1621 gacaagtacc tcaacttctc aaatccaaag atttcagaga attttgacgc tggtgcttgt 
50 1 681 gggtgggcat ttgggatgaa tatgtttgac ctgaaagagt ggaggaaacg gaacattaca 
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1741 gggatatatc actattggca agacttgaat gaagacagaa cactgtggaa gctgggatcg 
1801 ttgccaccgg ggctgataac attttacaac ctgacgtatg caatggatag gagctggcac 
1861 gtactagggc tgggatatga cccagcgcta aaccaaacag caatagagaa tgcagcggta 
1921 gtgcattaca atgggaacta caagccatgg ctgggtttag cattcgccaa gtacaaaccg 
5 1 981 tactggtcca agtacgttga gtacgacaac ccttatctcc gacggtgcga catcaatgaa 
2041 tga 



Amino Acid Sequence of Sequence #13: (SEQ ID NO: 26) 
10 Genebank I D#NP_1 95540.2 
Positions 1-680 of NP_1 95540. 

1 mttfstcaaf Islvwlhav hvggailesq aphrelkayr plqdnnlqev yassaaavhy 
61 dpdlkdvniv atysdhygni rlgrvkmgdl spswvlenpa yqvsrktkgs qlviprdsfq 

15 121 ndtgmednas hsttnqtdes enqfpnvdfa spaklkrqil rqerrgqrtl elirqeketd 

181 eqmqeaaiqk smsfensvig kysiwrrdye spnadailkl mrdqiimaka yaniaksknv 
241 tnlyvflmqq cgenkrvigk atsdadlpss aldqakamgh alslakdely dchelakkfr 
301 ailqsterkv dglkkkgtfl iqlaaktfpk plhclslqla adyfilgfne edavkedvsq 
361 kkledpslyh yaifsdnvla tsvwnstvl nakepqrhvf hivtdklnfg amkmwfrina 

20 421 padatiqven indfkwlnss ycsvlrqles arlkeyyfka nhpssisaga dnlkyrnpky 
481 Ismlnhlrfy Ipevypklek ilfldddiw qkdlaplwei dmqgkvngav etckesfhrf 
541 dkylnfsnpk isenfdagac gwafgmnmfd Ikewrkrnit giyhywqdln edrtlwklgs 
601 Ippglitfyn Ityamdrswh vlglgydpal nqtaienaav vhyngnykpw Iglafakykp 
661 ywskyveydn pylrrcdine 

25 

Sequence #14 (SEQ ID NO: 27) 
Gene name: At5g15470 

GeneBank accession # for reference: NM_121551 Gl:30685368 
30 Nucleotide sequence of Sequence #14: 
Positions 1-1599 of CDS of NM_121551. 

1 atgcagcttc acatatcgcc gagtatgaga agcattacga tttcgagcag caatgagttt 
61 attgacttga tgaagatcaa ggtcgcagct cgtcacatct cttaccgaac tctcttccac 

35 121 accatcttaa tcctcgcttt cttgttgcct tttgttttca ttctcaccgc tgttgttacc 

181 cttgagggtg tcaacaaatg ctcctccatt gattgtttag ggaggcggat aggtccacgt 
241 cttcttggta gggtagatga ttcagagaga ctagctagag acttttataa aattctaaac 
301 gaagtaagca ctcaagaaat tccagatggt ttgaagcttc caaattcttt tagtcaactt 
361 gtttccgata tgaagaataa ccactatgat gcaaaaacat ttgctcttgt gctgcgagcc 

40 421 atgatggaga agtttgaacg tgatatgagg gaatcgaaat ttgcagaact tatgaacaag 
481 cactttgcag caagttccat tcccaaaggc attcattgtc tctctctaag actgacagat 
541 gaatattcct ccaatgctca tgctcgaaga cagcttcctt caccagagtt tctccctgtt 
601 ctttcagata atgcttacca ccactttatt ttgtccacgg acaatatttt ggctgcctca 
661 gttgtggtct catccgctgt tcagtcatct tcaaaacccg agaaaattgt ctttcacatc 

45 721 attacagaca agaaaaccta tgcgggtatg cattcatggt ttgcgcttaa ttctgttgca 
781 ccagcaattg ttgaggttaa aggtgttcat cagtttgact ggttgacgag agagaatgtt 
841 ccggttttgg aagctgtgga aagccataat ggtgtcaggg actattatca tgggaatcat 
901 gtcgctgggg caaacctcac cgaaacaact cctcgaacat ttgcttcaaa attgcagtct 
961 agaagtccaa aatacatatc tttgctcaac catcttagaa tatatatacc agagcttttc 

50 1 021 ccgaacttgg acaaggtggt tttcttagac gatgatatag ttgtccaggg agacttaact 
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1081 ccactttggg atgttgacct cggtggtaag gtcaatgggg cagtagagac ttgcaggggt 
1 141 gaagatgaat gggtgatgtc aaagcgttta aggaactact tcaatttctc tcacccgctc 
1201 atcgcaaagc atttagatcc tgaagaatgt gcttgggcat atggtatgaa tatcttcgat 
1261 ctacaagctt ggaggaaaac aaatatcaga gaaacgtatc actcttggct tagagagaat 
5 1321 ctaaagtcaa atctgacaat gtggaaactt ggaaccttgc ctcctgctct tatcgcgttc 

1381 aagggtcacg tacacataat agactcgtca tggcatatgc taggattagg ctaccagagc 
1441 aagaccaaca tagaaaatgt gaagaaagca gcagtgatcc actacaatgg gcagtcaaag 
1 501 ccatggctgg agattggttt cgagcatctg cggccattct ggaccaaata cgtcaactac 
1 561 tcaaatgatt tcatcaagaa ctgtcacata ttggagtag 

Amino Acid Sequence of Sequence #14: (SEQ ID NO: 28) 
Genebank ID# NP_1 97051 
Positions 1-532 of NP_1 97051 . 

15 

1 mqlhispsmr sitisssnef idlmkikvaa rhisyrtlfh tililafllp fvfiltavvt 
61 legvnkcssi dclgrrigpr llgrvddser lardfykiln evstqeipdg Iklpnsfsql 
121 vsdmknnhyd aktfalvlra mmekferdmr eskfaelmnk hfaassipkg ihclslrltd 
181 eyssnaharr qlpspeflpv Isdnayhhfi Istdnilaas wvssavqss skpekivfhi 
20 241 itdkktyagm hswfalnsva paivevkgvh qfdwltrenv pvleaveshn gvrdyyhgnh 
301 vaganltett prtfasklqs rspkyislln hlriyipelf pnldkwfld ddiwqgdlt 
361 plwdvdlggk vngavetcrg edewvmskrl rnyfnfshpl iakhldpeec awaygmnifd 
421 Iqawrktnir etyhswlren Iksnltmwkl gtlppaliaf kghvhiidss whmlglgyqs 
481 ktnienvkka avihyngqsk pwleigfehl rpfwtkyvny sndfiknchi le 

25 

Sequence #15 (SEQ ID NO: 29) 
Gene name: At5g54690 

GeneBank accession # for reference: NM_1 24850 Gl:30696504 
30 Nucleotide sequence of Sequence #1 5: 
Positions 1-1608 of CDS of NM_1 24850. 

1 atgcagttac atatatctcc gagcttgaga catgtgactg tggtcacagg gaaaggattg 
61 agagagttca taaaagttaa ggttggttct agaagattct cttatcaaat ggtgttttac 

35 121 tctctactct tcttcacttt tcttctccga ttcgtctttg ttctctccac cgttgatact 

181 atcgacggcg atccctctcc ttgctcctct cttgcttgct tggggaaaag actaaagcca 
241 aagcttttag gaagaagggt tgattctggt aatgttccag aagctatgta ccaagtttta 
301 gaacagcctt taagcgaaca agaactcaaa ggaagatcag atatacctca aacacttcaa 
361 gatttcatgt ctgaagtcaa aagaagcaaa tcagacgcaa gagaatttgc tcaaaagcta 

40 421 aaagaaatgg tgacattgat ggaacagaga acaagaacgg ctaagattca agagtattta 
481 tatcgacatg tcgcatcaag cagcataccg aaacaacttc actgtttagc tcttaaacta 
541 gccaacgaac actcgataaa cgcagcggcg cgtctccagc ttccagaagc tgagcttgtc 
601 cctatgttgg tagacaacaa ctactttcac tttgtcttgg cttcagacaa tattcttgca 
661 gcttcggttg tggctaagtc gttggttcaa aatgctttaa gacctcataa gatcgttctt 

45 721 cacatcataa cggataggaa aacttatttc ccaatgcaag cttggttctc attgcatcct 
781 ctgtctccag caataattga ggtcaaggct ttgcatcatt tcgattggtt atcgaaaggt 
841 aaagtacccg ttttggaagc tatggagaaa gatcagagag tgaggtctca attcagaggt 
901 ggatcatcgg ttattgtggc taataacaaa gagaacccgg ttgttgttgc tgctaagtta 
961 caagctctca gccctaaata caactccttg atgaatcaca tccgtattca tctaccagag 

50 1 021 ttgtttccaa gcttaaacaa ggttgtgttt ctagacgatg acattgtgat ccaaactgat 
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1081 ctttcacctc tttgggacat tgacatgaat ggaaaagtaa atggagcagt ggaaacatgt 
1141 agaggagaag acaagtttgt gatgtcaaag aagttcaaga gttacctcaa cttctcgaat 
1201 ccgacaattg ccaaaaactt caatccagag gaatgtgcat gggcttatgg aatgaatgtt 
1261 ttcgacctag cggcttggag gaggactaac ataagctcca cttactatca ttggcttgac 
5 1 321 gagaacttaa aatcagacct gagtttgtgg cagctgggaa ctttgcctcc tgggctgatt 
1381 gctttccacg gtcatgtcca aaccatagat ccgttctggc atatgcttgg tctcggatac 
1441 caagagacca cgagctatgc cgatgctgaa agtgccgctg ttgttcattt caatggaaga 
1501 gctaagcctt ggctggatat agcatttcct catctacgtc ctctctgggc taagtatctt 
1561 gattcttctg acagatttat caagagctgt cacattagag catcatga 

10 

Amino Acid Sequence of Sequence #15: (SEQ ID NO: 30) 
Genebank ID# NP_200280 
Positions 1-535 of NM_200280. 

15 1 mqlhispslr hvtvvtgkgl refikvkvgs rrfsyqmvfy sllfftfllr fvfvlstvdt 

61 idgdpspcss laclgkrlkp kllgrrvdsg nvpeamyqvl eqplseqelk grsdipqtlq 
121 dfmsevkrsk sdarefaqkl kemvtlmeqr trtakiqeyl yrhvasssip kqlhclalkl 
181 anehsinaaa rlqlpeaelv pmlvdnnyfh fvlasdnila aswakslvq nalrphkivl 
241 hiitdrktyf pmqawfslhp Ispaiievka Ihhfdwlskg kvpvleamek dqrvrsqfrg 

20 301 gssvivannk enpvwaakl qalspkynsl mnhirihlpe Ifpslnkwf Idddiviqtd 

361 Isplwdidmn gkvngavetc rgedkfvmsk kfksylnfsn ptiaknfnpe ecawaygmnv 
421 fdlaawrrtn isstyyhwld enlksdlslw qlgtlppgli afhghvqtid pfwhmlglgy 
481 qettsyadae saawhfngr akpwldiafp hlrplwakyl dssdrfiksc hiras 

25 

The nucleotide and amino acid sequences of the ten GALAT-LIKE gene 
family members are shown as follows. 



Sequence #16 (SEQ ID NO:31) 

30 Gene name: At1g02720 

GeneBank accession # for reference: NM_1 001 52, Gl: 30678358 
Nucleotide sequence of Sequence #16: 
Positions 1-1086 of CDS of NM_100152. 

35 

1 atgcattgga ttacgagatt ctctgctttc ttctccgccg cattagccat gattctcctt 
61 tctccttcgc tccaatcctt ttctccggcg gcagctatcc gatcatctca cccctacgcc 
121 gacgaattca aaccccaaca aaactccgat tactcctcct tcagagaatc tccaatgttc 
181 cgtaacgccg aacaatgcag atcttccggc gaagattccg gcgtctgtaa ccctaatctc 

40 241 gtccacgtag ccatcactct cgacatcgat tacctccgtg gctcaatcgc agccgtcaat 
301 tcgatcctcc agcactcaat gtgccctcaa agcgtcttct tccacttcct cgtctcctcc 
361 gagtctcaaa acctagaatc tctgattcgt tctactttcc ccaaattgac gaatctcaaa 
421 atttactatt ttgcccctga gaccgtacag tctttgattt catcttccgt gagacaagcc 
481 ctagagcaac cgttgaatta cgccagaaat tacttggcgg atctgctcga gccttgcgtt 

45 541 aagcgagtca tctacttgga ttcggatctc gtcgtcgtcg atgatatcgt caagctttgg 

601 aaaacgggtt taggccagag aacaatcgga gctccggagt attgtcacgc gaatttcacg 
661 aaatacttca ccggaggttt ttggtcagat aagaggttta acgggacgtt caaagggagg 
721 aacccttgtt acttcaatac tggtgtaatg gtgattgatt tgaagaagtg gagacaattt 
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781 aggttcacga aacgaattga gaaatggatg gagattcaga agatagagag gatttatgag 
841 cttggttctc ttcctccgtt tcttctggta tttgctggtc atgtagctcc gatttcacat 
901 cggtggaatc aacatgggct tggtggtgat aatgttagag gtagttgccg tgatttgcat 
961 tctggtcctg tgagtttgct tcactggtca ggtagtggta agccatggtt aagactcgat 
5 1021 tccaagcttc catgtccttt agacacattg tgggcacctt atgatttgta taaacactcc 
1081 cattga 



10 Amino Acid Sequence of Sequence #16: (SEQ ID NO: 32) 
Genebank ID# NP_171772 
Positions 1-361. 



15 1 mhwitrfsaf fsaalamill spslqsfspa aairsshpya defkpqqnsd yssfrespmf 

61 rnaeqcrssg edsgvcnpnl vhvaitldid ylrgsiaavn silqhsmcpq svffhflvss 
121 esqnleslir stfpkltnlk iyyfapetvq slisssvrqa leqplnyarn yladllepcv 
181 krviyldsdl wvddivklw ktglgqrtig apeychanft kyftggfwsd krfngtfkgr 
241 npcyfntgvm vidlkkwrqf rftkriekwm eiqkieriye Igslppfllv faghvapish 

20 301 rwnqhglggd nvrgscrdlh sgpvsllhws gsgkpwlrld sklpcpldtl wapydlykhs 
361 h 



Sequence #17 (SEQ ID NO:33) 

Gene name: At1g13250 
25 GeneBank accession # for reference: NM_101 196, Gl:30683194 
Nucleotide sequence of Sequence #17: 
Positions 1-1038 of CDS of NM 101196. 



30 1 atgtcttctc tgcgtttgcg tttatgtctt cttctactct tacctatcac aattagctgc 

61 gtcacagtca ctctcactga cctccccgcg tttcgtgaag ctccggcgtt tcgaaacggc 
121 agagaatgct ccaaaacgac atggatacct tcggatcacg aacacaaccc atcaatcatc 
181 cacatcgcta tgactctcga cgcaatttac ctccgtggct cagtcgccgg cgtcttctcc 
241 gttctccaac acgcttcttg tcctgaaaac atcgttttcc acttcatcgc cactcaccgt 

35 301 cgcagcgccg atctccgccg cataatctcc tcaacattcc catacctaac ctaccacatt 
361 taccattttg accctaacct cgtccgcagc aaaatatctt cctctattcg tcgtgcttta 
421 gaccaaccgt taaactacgc tcggatctac ctcgccgatc tcctcccaat cgccgtccgc 
481 cgcgtaatct acttcgactc cgatctcgta gtcgtcgatg acgtggctaa actctggaga 
541 atcgatctac gtcggcacgt cgtcggagct ccggagtact gtcacgcgaa tttcactaac 

40 601 tacttcactt caagattctg gtcgagtcaa ggttacaaat cggcgttgaa agataggaaa 
661 ccgtgttatt tcaacaccgg agtgatggtg attgatctcg gaaaatggag agaaaggaga 
721 gtcacggtga agctagagac atggatgagg attcaaaaac gacatcgtat ttacgaattg 
781 ggatctttgc ctccgtttct gctcgttttc gccggagatg ttgagccggt ggagcatagg 
841 tggaatcagc atggtcttgg tggtgataac ttggaaggac tttgccggaa tttgcatcca 

45 901 ggtccggtga gtttgttgca ttggagcggg aaagggaaac catggctaag gcttgactcg 
961 agacgaccgt gtccgttgga ttcgttatgg gctccttatg atttgtttcg ttattcaccg 
1021 ttgatctctg atagctga 
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Amino Acid Sequence of Sequence #17: (SEQ ID NO: 34) 
Genebank ID# NP_563925 
Positions 1-345. 

5 1 msslrlrlcl llllpitisc vtvtltdlpa freapafrng recskttwip sdhehnpsii 

61 hiamtldaiy Irgsvagvfs vlqhascpen ivfhfiathr rsadlrriis stfpyltyhi 
121 yhfdpnlvrs kisssirral dqplnyariy ladllpiavr rviyfdsdlv wddvaklwr 
181 idlrrhvvga peychanftn yftsrfwssq gyksalkdrk pcyfntgvmv idlgkwrerr 
241 vtvkletwmr iqkrhriyel gslppfllvf agdvepvehr wnqhglggdn leglcrnlhp 

10 301 gpvsllhwsg kgkpwlrids rrpcpldslw apydlfrysp lisds 



Sequence #18 (SEQ ID NO:35) 
Gene name: At1g 19300 

GeneBank accession # for reference: NM_101787, Gl:30686302 
IS Nucleotide sequence of Sequence #1 8: 
Positions 1-1056 of CDS of NM_1 01787. 

1 atgtcccaac atcttcttct tctcattctc ctctcgctac ttcttcttca taaacccatt 
61 tccgccacta caattattca aaaattcaaa gaagccccac agttttacaa ttctgcagat 

20 121 tgccccttaa tcgatgactc cgagtccgac gatgacgtgg tcgccaaacc aatcttctgc 
181 tcacgtcgag ctgtccacgt ggcgatgaca ctcgacgccg cctacattcg tggctcagtc 
241 gccgctgttc tctccgtcct ccaacactct tcttgtcctg aaaacattgt tttccacttc 
301 gtcgcctctg cttccgccga cgcttcttcc ttacgagcca ccatatcctc ctctttccct 
361 taccttgatt tcaccgtcta cgtcttcaac gtctcctccg tctctcgcct tatctcctcc 

25 421 tctatccgct ccgcactaga ctgtccttta aactacgcaa gaagctacct cgccgatctc 
481 ctccctccct gcgtccgccg cgtcgtctac ctagactccg atctgatcct cgtcgacgac 
541 atagcaaaac tcgccgccac agatctcggc cgtgattcag tcctcgccgc gccggaatac 
601 tgcaacgcca atttcacttc atacttcaca tcaaccttct ggtctaatcc gactctctct 
661 ttaaccttcg ccgatcggaa agcatgctac ttcaacactg gagtcatggt gatcgatctt 

30 721 tcccggtggc gcgaaggcgc gtacacgtca cgcatcgaag agtggatggc gatgcaaaag 
781 agaatgagaa tttacgagct tggttcgtta ccaccgtttt tattggtttt tgccggtttg 
841 attaaaccgg ttaatcatcg gtggaaccaa cacggtttag gaggtgataa tttcagagga 
901 ctgtgtagag atctccatcc tggtccggtg agtctgttgc attggagtgg gaaaggtaag 
961 ccatgggcta ggcttgatgc tggtcggcct tgtcctttag acgcgctttg ggctccgtat 

35 1021 gatcttcttc aaacgccgtt cgcgttggat tcttga 

Amino Acid Sequence of Sequence #18: (SEQ ID NO: 36) 
Genebank ID# NP_564077 
Positions 1-351. 

40 

1 msqhllllil Isllllhkpi sattiiqkfk eapqfynsad cpliddsesd ddvvakpifc 
61 srravhvamt Idaayirgsv aavlsvlqhs scpenivfhf vasasadass Iratisssfp 
121 yldftvyvfn vssvsrliss sirsaldcpl nyarsyladl Ippcvnwy Idsdlilvdd 
181 iaklaatdlg rdsvlaapey cnanftsyft stfwsnptls Itfadrkacy fntgvmvidl 
45 241 srwregayts rieewmamqk rmriyelgsl ppfllvfagl ikpvnhrwnq hglggdnfrg 
301 Icrdlhpgpv sllhwsgkgk pwarldagrp cpldalwapy dllqtpfald s 
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Sequence #19 fSEQ ID NO:37) 
Gene name: At1g24170 

GeneBank accession # for reference: NM_1 02263, Gl:30688765 
5 Nucleotide sequence of Sequence #1 9: 
Positions 1-1 182 of CDS of NM_1 02263. 

1 atgtcgtcgc gtttttcttt gacggtggtg tgtttgattg ctctgttacc gtttgttgtt 
61 ggtatacggt tgattccggc gaggatcacg agtgtcggtg atggcggcgg cggaggaggt 

10 121 aataatgggt ttagtaaact tggtccgttt atggaagctc cggagtatag aaacggcaag 
181 gagtgtgtat cttcatcagt gaacagagag aacttcgtgt cgtcttcttc tagttctaat 
241 gatccttcgc ttgttcacat cgctatgact ttggactcag agtatctccg tggatcaatc 
301 gcagccgttc attctgttct tcgccacgcg tcttgtccag agaacgtctt cttccatttc 
361 atcgctgctg agtttgactc tgcgagtcct cgtgttctga gtcaactcgt gaggtcgact 

15 421 tttccttcgt tgaactttaa agtctacatt tttagggaag atacggtgat caatctcata 
481 tcttcttcga ttagactagc tttggagaat ccgttgaact atgctcggaa ctatctcgga 
541 gatattcttg atcgaagtgt tgaacgagtc atttatcttg actcggatgt tataactgtg 
601 gatgatatca caaagctttg gaacacggtt ttgaccgggt cacgagtcat cggagctccg 
661 gagtattgtc acgcgaactt cactcagtat ttcacttccg ggttctggtc agacccggct 

20 721 ttaccgggtc taatctcggg tcaaaagcct tgctatttca acacaggagt gatggtgatg 

781 gatcttgtta gatggagaga agggaattac agagagaagt tagagcaatg gatgcaattg 
841 cagaagaaga tgagaatcta cgatcttgga tcattaccac cgtttctttt ggtgtttgcg 
901 ggtaatgttg aagctattga tcatagatgg aaccaacatg gtttaggagg agacaatata 
961 cgaggaagtt gtcggtcatt gcatcctggt cctgtgagct tgttgcattg gagtggtaaa 

25 1 021 ggtaagccat gggttagact tgatgagaag aggccttgtc cgttggatca tctttgggag 
1081 ccatatgatt tgtataagca taagattgag agagctaaag atcagtctct gcttgggttt 
1141 gcttctctgt cggagttgac tgatgattca agcttcttgt ga 



30 Amino Acid Sequence of Sequence #19: (SEQ ID NO: 38) 
Genebank ID# NP_173827 
Positions 1-393. 

1 mssrfsltw cliallpfw girliparit svgdgggggg nngfsklgpf meapeyrngk 
35 61 ecvsssvnre nfvssssssn dpslvhiamt Idseylrgsi aavhsvlrha scpenvffhf 

121 iaaefdsasp rvlsqlvrst fpslnfkvyi fredtvinli sssirlalen plnyarnylg 
181 dildrsverv iyldsdvitv dditklwntv Itgsrvigap eychanftqy ftsgfwsdpa 
241 Ipglisgqkp cyfntgvmvm dlvrwregny rekleqwmql qkkmriydlg slppfllvfa 
301 gnveaidhrw nqhglggdni rgscrslhpg pvsllhwsgk gkpwvrldek rpcpldhlwe 
40 361 pydlykhkie rakdqsllgf aslseltdds sfl 
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Sequence #20 (SEQ ID NO:39) 
Gene name: At1g70090 

GeneBank accession # for reference: NM_1 05677, Gl:30697975 
Nucleotide sequence of Sequence #20: 
Positions 1-1 173 of CDS of NM_1 05677. 

1 atgcggttgc gttttccgat gaaatctgcc gttttagcgt ttgctatctt tctggtgttt 
61 attccactgt tttccgtcgg tatacggatg attccgggaa gactcaccgc cgtatccgcc 
121 accgtcggaa atggctttga tctggggtcg ttcgtggaag ctccggagta cagaaacggc 
181 aaggagtgcg tgtctcaatc gttgaacaga gaaaacttcg tgtcgtcttg cgacgcttcg 
241 ttagttcatg tagctatgac gcttgactcg gagtacttac gtggctcaat cgcagccgta 
301 cattcaatgc tccgccacgc gtcgtgtcca gaaaacgtct tcttccatct catcgctgca 
361 gagtttgacc cggcgagtcc acgcgttctg agtcaactcg tccgatctac tttcccgtcg 
421 ctaaacttca aagtctacat tttccgggaa gatacggtga tcaaccttat ctcttcttca 
481 atcagacaag ctttagagaa tccattgaac tatgctcgga actacctcgg agatattctt 
541 gatccatgcg tagacagagt catttaccta gactcggaca tcatcgtcgt cgatgacata 
601 acaaagcttt ggaacacgag tttgacaggg tcaagaatca tcggagctcc ggagtattgt 
661 cacgctaact tcacaaagta cttcacttca ggtttctggt ccgacccggc tttacccggt ~ 
721 ttcttctcgg gtcgaaagcc ttgttatttc aacacgggtg tgatggtgat ggatctagtt 
781 agatggagag aaggaaacta cagagaaaag cttgaaactt ggatgcagat acagaagaag 
841 aagagaatct acgatttggg ttctttgcct ccgtttcttc ttgtcttcgc agggaacgtt 
901 gaagcaattg atcataggtg gaaccaacat ggtttaggag gagacaatgt acgaggaagt 
961 tgtaggtctt tgcataaagg accagtgagt ttgttgcatt ggagtggtaa aggtaagcca 
1021 tgggtgagac ttgatgagaa gagaccgtgt ccgttggatc atttatggga accgtatgat 
1081 ttatatgagc ataagattga aagagctaaa gatcagtctt tgttcgggtt ctcttctttg 
1141 tctgagttaa cagaagattc aagctttttc tga 



Amino Acid Sequence of Sequence #20: (SEQ ID NO: 40) 
Genebank ID# NP_564983 
Positions 1-390. 



1 mrlrfpmksa vlafaiflvf iplfsvgirm ipgrltavsa tvgngfdlgs fveapeyrng 
61 kecvsqslnr enfvsscdas Ivhvamtlds eylrgsiaav hsmlrhascp envffhliaa 
121 efdpasprvl sqlvrstfps Infkvyifre dtvinlisss irqalenpln yarnylgdil 
181 dpcvdrviyl dsdiivvddi tklwntsltg sriigapeyc hanftkyfts gfwsdpalpg 
241 ffsgrkpcyf ntgvmvmdlv rwregnyrek letwmqiqkk kriydlgslp pfllvfagnv 
301 eaidhrwnqh glggdnvrgs crslhkgpvs llhwsgkgkp wvrldekrpc pldhlwepyd 
361 lyehkierak dqslfgfssl seltedssff 
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Sequence #21 (SEQ ID NO:41) 
Gene name: At3g06260 

GeneBank accession # for reference: NM_1 1 1501 , Gl:18397517 
5 Nucleotide sequence of Sequence #21 : 
Positions 1-1 056 of CDS of NM_1 1 1501 . 

1 atggcctcaa ggagcctctc ctatacacaa ctcctaggcc tcctgtcctt tatactcctc 
61 ttggtcacaa ccaccactat ggcggttcgt gttggagtca ttcttcataa gccttctgct 

10 121 ccaactcttc ctgttttcag agaagccccg gcttttcgaa acggtgatca atgcgggact 
181 cgtgaggctg atcagattca tatcgccatg actctcgaca caaactacct ccgtggcaca 
241 atggctgccg ttttgtctct ccttcaacat tccacttgcc ctgaaaacct ctcttttcat 
301 ttcctgtccc ttcctcattt cgaaaacgac cttttcacca gcatcaaatc aacctttcct 
361 tacctaaact tcaagattta tcagtttgat ccaaacctcg tccgcagcaa gatatcgaaa 

15 421 tccatcaggc aagcccttga tcagcctctt aactacgcaa gaatctacct cgcggatatc 
481 atccctagca gcgttgacag gatcatctac ttagactcag acctcgttgt ggtagacgac 
541 atagagaagc tgtggcatgt ggagatggaa ggtaaagtgg tggctgctcc cgagtactgc 
601 cacgcaaact tcacccatta tttcacaaga actttctggt cagacccggt attggtcaaa 
661 gttcttgaag gaaaacgtcc gtgttatttc aacacagggg tgatggttgt ggatgtaaac 

20 721 aaatggagga aaggaatgta tacacagaag gtagaagagt ggatgacaat tcagaagcag 
781 aagaggatat accatttggg atcattacct ccgtttctgc tgatattcgc cggtgatata 
841 aaagcggtta atcataggtg gaaccagcat ggtctaggag gtgataattt cgaaggaaga 
901 tgtagaacgt tgcatccggg accgataagt cttcttcact ggagtggaaa agggaagcca 
961 tggttaagac tagattcaag gaagccttgt atcgttgatc atctatgggc accgtatgat 

25 1021 ctgtaccgtt catcaagaca ttcattagaa gagtag 



Amino Acid Sequence of Sequence #21 : (SEQ ID NO: 42) 
30 Genebank ID# NP_1 87277 
Positions 1-351. 

1 masrslsytq llgllsfill Ivttttmavr vgvilhkpsa ptlpvfreap afrngdqcgt 
61 readqihiam tldtnylrgt maavlsllqh stcpenlsfh flslphfend Iftsikstfp 
35 121 ylnfkiyqfd pnlvrskisk sirqaldqpl nyariyladi ipssvdriiy Idsdlvvvdd 

181 ieklwhveme gkvvaapeyc hanfthyftr tfwsdpvlvk vlegkrpcyf ntgvmvvdvn 
241 kwrkgmytqk veewmtiqkq kriyhlgslp pfllifagdi kavnhrwnqh glggdnfegr 
301 crtlhpgpis llhwsgkgkp wlrldsrkpc ivdhlwapyd lyrssrhsle e 
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Sequence #22 (SEQ ID NO:43) 
Gene name: At3g28340 

GeneBank accession # for reference: NM_1 13753, Gl:30689155 
5 Nucleotide sequence of Sequence #22: 
Positions 1-1098 of CDS of NM_1 13753. 

1 atgatgtctg gttcaagatt agcctctaga ctaataataa tcttctcaat aatctccaca 
61 tctttcttca ccgttgaatc gattcgacta ttccctgatt cattcgacga tgcatcttca 

10 121 gatttaatgg aagctccagc atatcaaaac ggtcttgatt gctctgtttt agccaaaaac 
181 agactcttgt tagcttgtga tccatcagct gttcatatag ctatgactct agatccagct 
241 tacttgcgtg gcacggtatc tgcagtacat tccatcctca aacacacttc ttgccctgaa 
301 aacatcttct tccacttcat tgcttcgggt acaagtcagg gttccctcgc caagacccta 
361 tcctctgttt ttccttcttt gagtttcaaa gtctatacct ttgaagaaac cacggtcaag 

15 421 aatctaatct cttcttctat aagacaagct cttgatagtc ctttgaatta cgcaagaagc 
481 tacttatccg agattctttc ttcgtgtgtt agtcgagtga tttatctcga ttcggatgtg 
541 attgtggtcg atgatattca gaaactatgg aagatttctt tatccgggtc aagaacaatc 
601 ggtgcaccag agtattgcca cgcaaatttc accaaatact tcacagatag tttctggtcc 
661 gatcaaaaac tctcgagtgt cttcgattcc aagactcctt gttatttcaa cacaggagtg 

20 721 atggttatcg atttagagcg atggagagaa ggagattaca cgagaaagat cgaaaactgg 
781 atgaagattc agaaagaaga taagagaatc tacgaattgg gttctttacc accgtttctt 
841 ctagtgtttg gtggtgatat tgaagctatt gatcatcaat ggaaccaaca cggtctcggt 
901 ggagacaaca ttgtgagtag ttgtagatct ttgcatcctg gtccggttag tttgatacat 
961 tggagtggta aagggaagcc atgggttagg cttgatgatg gtaagccttg tccaattgat 

25 1021 tatctttggg ctccttatga tcttcacaag tcacagaggc agtatcttca atacaatcaa 
1081 gagttagaaa ttctttga 



30 Amino Acid Sequence of Sequence #22: (SEQ ID NO: 44) 
Genebank ID# NP_1 89474 
Positions 1-365. 



35 1 mmsgsrlasr liiifsiist sfftvesirl fpdsfddass dlmeapayqn gldcsvlakn 

61 rlllacdpsa vhiamtldpa ylrgtvsavh silkhtscpe niffhfiasg tsqgslaktl 
121 ssvfpslsfk vytfeettvk nlisssirqa Idsplnyars ylseilsscv srviyldsdv 
181 ivvddiqklw kislsgsrti gapeychanf tkyftdsfws dqklssvfds ktpcyfntgv 
241 mvidlerwre gdytrkienw mkiqkedkri yelgslppfl Ivfggdieai dhqwnqhglg 

40 301 gdnivsscrs Ihpgpvslih wsgkgkpwvr Iddgkpcpid ylwapydlhk sqrqylqynq 
361 eleil 

Sequence #23 (SEQ ID NO:45) 

45 Gene name: At3g50760 

GeneBank accession # for reference: NM_1 14936, GM8409176 
Nucleotide sequence of Sequence #23: 
Positions 1-1026 of CDS of NM_1 14936. 

50 1 atgcactcga agtttatatt atatctcagc atcctcgccg tattcaccgt ctctttcgcc 
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61 ggcggcgaga gattcaaaga agctccaaag ttcttcaact ccccggagtg tctaaccatc 
121 gaaaacgatg aagatttcgt ttgttcagac aaagccatcc acgtggcaat gaccttagac 
181 acagcttacc tccgtggctc aatggccgtg attctctccg tcctccaaca ctcttcttgt 
241 cctcaaaaca ttgttttcca cttcgtcact tcaaaacaaa gccaccgact ccaaaactac 
5 301 gtcgttgctt cttttcccta cttgaaattc cgaatttacc cttacgacgt agccgccatc 

361 tccggcctca tctcaacctc catccgctcc gcgctagact ctccgctaaa ctacgcaaga 
421 aactacctcg ccgacattct tcccacgtgc ctctcacgtg tcgtatacct agactcagat 
481 ctcatactcg tcgatgacat ctccaagctc ttctccactc acatccctac cgacgtcgtt 
541 ttagccgcgc ctgagtactg caacgcaaac ttcacgactt actttactcc gacgttttgg 

10 601 tcaaaccctt ctctctccat cacactatcc ctcaaccgcc gtgctacacc gtgttacttc 

661 aacaccggag tgatggtcat cgagttaaag aaatggcgag aaggagatta cacgaggaag 
721 atcatagagt ggatggagtt acaaaaacgg ataagaatct acgagttagg ctctttacca 
781 ccgtttttac ttgtcttcgc cggaaacata gctccggtag atcaccggtg gaaccaacac 
841 ggtttaggag gagataattt tagaggactg tgtcgagatt tgcatccagg tccagtgagt 

15 901 ttgttgcatt ggagtgggaa agggaagcca tgggtaaggt tagatgatgg tcgaccttgc 
961 ccgcttgatg cactttgggt tccatatgat ttgttagagt cacggttcga ccttatcgag 
1021 agttaa 



20 Amino Acid Sequence of Sequence #23: (SEQ ID NO: 46) 
Genebank ID# NP_1 90645 
Positions 1-341. 

1 mhskfilyls ilavftvsfa ggerfkeapk ffnspeclti endedfvcsd kaihvamtld 
25 61 taylrgsmav ilsvlqhssc pqnivfhfvt skqshrlqny wasfpylkf riypydvaai 

121 sglistsirs aldsplnyar nyladilptc Isrwyldsd lilvddiskl fsthiptdw 
181 laapeycnan fttyftptfw snpslsitls Inrratpcyf ntgvmvielk kwregdytrk 
241 iiewmelqkr iriyelgslp pfllvfagni apvdhrwnqh glggdnfrgl crdlhpgpvs 
301 llhwsgkgkp wvrlddgrpc pldalwvpyd llesrfdlie s 

30 

Sequence #24 (SEQ ID NO:47) 
Gene name: At3g62660 

GeneBank accession # for reference: NM_1 16131, Gl:30695642 
35 Nucleotide sequence of Sequence #24: 
Positions 1 -1 086 of CDS of NM 116131. 



1 atgctttgga tcatgagatt ctccggttta ttctccgccg ctttggttat catcgtcctc 
40 61 tctccttctc tccaatcgtt tcctccagct gaagctatca gatcctctca tctcgacgct 

121 tacctccgtt tcccctcctc cgatccaccg ccgcatagat tctccttcag aaaagctcct 
181 gttttccgca atgccgccga ttgcgccgcc gcagatatcg attccggcgt ctgtaaccct 
241 tccttggtcc acgtcgcgat tactctcgat ttcgagtacc tgcgtggctc aatcgccgcc 
301 gttcattcga ttctcaagca ctcgtcgtgt cccgagagcg tcttcttcca tttcctcgtc 
45 361 tccgagactg acctagaatc cttgattcgt tcgacttttc ccgaattgaa attaaaggtt 

421 tactacttcg atccggagat tgtacggacg ctgatctcaa cctccgtgag acaagcgctc 
481 gagcagccgt tgaattacgc tagaaattac ctagctgacc ttctcgagcc ttgcgtgcgt 
541 cgcgtgatct acctagattc cgatctaatc gtcgtcgacg acatcgcaaa gctctggatg 
601 acgaaactgg gatcgaaaac gatcggagct cccgagtact gtcacgcgaa cttcacaaag 
50 661 tatttcacac cggcgttctg gtccgacgag aggttctccg gagctttctc cgggaggaaa 
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721 ccgtgctact tcaacacggg agtgatggtg atggatctag agagatggag gcgcgtaggg 
781 tacacggagg tgatagagaa atggatggag attcagaaga gtgataggat ttacgagctg 
841 ggatcattgc cgccgttctt gttggtgttc gccggagaag tagctccgat agagcatcgg 
901 tggaaccagc atgggcttgg tggagataac gtgagaggaa gctgtagaga tttacatccc 
5 961 ggtccggtta gcttgcttca ttggtccggt agtggtaaac cgtggtttcg gttagattcg 

1021 agacggcctt gtccacttga tactctttgg gcaccttatg atttgtatgg acactactct 

1081 cgctga 

10 Amino Acid Sequence of Sequence #24: (SEQ ID NO: 48) 
Genebank ID# NP_191825 
Positions 1-361. 

1 mlwimrfsgl fsaalviivl spslqsfppa eairsshlda ylrfpssdpp phrfsfrkap 
15 61 vfrnaadcaa adidsgvcnp slvhvaitld feylrgsiaa vhsilkhssc pesvffhflv 

121 setdleslir stfpelklkv yyfdpeivrt listsvrqal eqplnyarny ladllepcvr 
181 rviyldsdli wddiaklwm tklgsktiga peychanftk yftpafwsde rfsgafsgrk 
241 pcyfntgvmv mdlerwrrvg yteviekwme iqksdriyel gslppfllvf agevapiehr 
301 wnqhglggdn vrgscrdlhp gpvsllhwsg sgkpwfrlds rrpcpldtlw apydlyghys 
20 361 r 

Sequence #25 (SEQ ID NO:49) 

Gene name: At4g02130 
25 GeneBank accession # for reference: NM_1 16445, Gl:1 841 1845 
Nucleotide sequence of Sequence #25: 
Positions 1-1041 of CDS of NM_1 16445. 

1 atgctttgga taacgagatt tgctggatta ttctccgccg cgatggcagt gatcgtgtta 

30 61 tctccgtcgc ttcagtcatt tcctccggcg gcggcaatcc gttcttctcc atcaccgatc 

121 ttcagaaaag ctccagcggt gttcaacaac ggcgacgaat gtctctcctc cggcggcgtc 
181 tgcaatccgt cgttggtcca cgtggcgatc acgttagacg tagagtacct gcgtggctca 
241 atcgcagccg ttaactcgat ccttcagcac tcggtgtgtc cagagagcgt cttcttccac 
301 ttcatcgccg tctccgagga aacaaacctg ttggagtcgc tggtgagatc ggttttcccg 

35 361 agactgaaat tcaatattta cgattttgcc cctgagacag ttcgtggttt gatttcttct 

421 tccgtgagac aagctctcga gcagcctctg aactacgcta gaagctactt agcggatctg 
481 ctggagcctt gtgttaaccg tgtcatatac ttggattcgg atcttgtcgt cgtcgatgac 
541 atcgctaagc tttggaaaac tagcctaggc tcgaggataa tcggagctcc ggagtattgt 
601 cacgcgaatt tcacgaaata cttcaccgga ggattctggt cggaggagag attctccggt 

40 661 acctttagag ggaggaagcc atgttacttc aacacaggtg tgatggtgat agatcttaag 

721 aaatggagaa gaggtggtta cacgaaacgt atcgagaaat ggatggagat tcagagaaga 
781 gagaggattt acgaactagg ctcgcttcca ccgtttcttc tagttttctc cggtcacgtg 
841 gctcccatct ctcaccggtg gaaccagcat ggacttggtg gtgacaatgt tagaggtagc 
901 tgtcgtgatt tgcatcctgg tcctgtgagt ttgctgcatt ggtctggtag tggcaagccc 

45 961 tggataagac tcgattccaa acggccttgt cccttagacg cattatggac gccttacgac 
1021 ttgtatcgac attcgcattg a 
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Amino Acid Sequence of Sequence #25: (SEQ ID NO: 50) 
Genebank ID# NP_1 92122 
Positions 1-346. 

1 mlwitrfagl fsaamavivl spslqsfppa aairsspspi frkapavfnn gdeclssggv 
61 cnpslvhvai tldveylrgs iaavnsilqh svcpesvffh fiavseetnl leslvrsvfp 
121 rlkfniydfa petvrgliss svrqaleqpl nyarsyladl lepcvnrviy Idsdlvwdd 
181 iaklwktslg sriigapeyc hanftkyftg gfwseerfsg tfrgrkpcyf ntgvmvidlk 
241 kwrrggytkr iekwmeiqrr eriyelgslp pfllvfsghv apishrwnqh glggdnvrgs 
301 crdlhpgpvs llhwsgsgkp wirldskrpc pldalwtpyd lyrhsh 



Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. 
It is to be understood that the invention includes all such variations and 
modifications. The invention also includes all of the steps, features, compositions 
and compounds referred to or indicated in this specification, individually or 
collectively, and any and ail combinations of any two or more of said steps or 
features. 

The amino acids which occur in the various amino acid sequences referred 
to in the specification have their usual three- and one-letter abbreviations routinely 
used in the art: A, Ala, Alanine; C, Cys, Cysteine; D, Asp, Aspartic Acid; E, Glu, 
Glutamic Acid; F, Phe, Phenylalanine; G, Gly, Glycine; H, His, Histidine; I, lie, 
Isoleucine; K, Lys, Lysine; L, Leu, Leucine; M, Met, Methionine; N, Asn, 
Asparagine; P, Pro, Proline; Q, Gin, Glutamine; R, Arg, Arginine; S, Ser, Serine; T, 
Thr, Threonine; V, Val, Valine; W, Try, Tryptophan; Y, Tyr, Tyrosine. 

A protein is considered an isolated protein if it is a protein isolated from the 
plant, or from a host cell in which it is recombinantly produced. It can be purified or 
it can simply be free of other proteins and biological materials with which it is 
associated in nature. 

An isolated nucleic acid is a nucleic acid the structure of which is not 
identical to that of any naturally occurring nucleic acid or to that of any fragment of 
a naturally occurring genomic nucleic acid spanning more than three separate 
genes. The term therefore covers, for example, (a) a DNA which has the sequence 



53 



WO 2004/072250 



PCT/US2004/003545 



of part of a naturally occurring genomic DNA molecule but is not flanked by both of 
the coding or noncoding sequences that flank that part of the molecule in the 
genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated 
into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner 

5 such that the resulting molecule is not identical to any naturally occurring vector or 
genomic DNA; (c) a separate molecule such as a cDNA, a genomic fragment, a 
fragment produced by polymerase chain reaction (PCR), or a restriction fragment; 
and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a 
gene encoding a fusion protein. Specifically excluded from this definition are 

10 nucleic acids present in mixtures of (i) DNA molecules, (ii) transformed or 
transfected cells, and (iii) cell clones, e.g., as these occur in a DNA library such as 
a cDNA or genomic DNA library. 

As used herein expression directed by a particular sequence is the 
15 transcription of an associated downstream sequence. If appropriate and desired 
for the associated sequence, there the term expression also encompasses 
translation (protein synthesis) of the transcribed RNA. When expression of a 
sequence of interest is "up-regulated," the expression is increased. With reference 
to up-regulation of expression of a sequence of interest operably linked to a 
20 transcription regulatory sequence, expression is increased. 

In the present context, a promoter is a DNA region which includes 
sequences sufficient to cause transcription of an associated (downstream) 
sequence. The promoter may be regulated, i.e., not constitutively acting to cause 

25 transcription of the associated sequence. If inducible, there are sequences present 
which mediate regulation of expression so that the associated sequence is 
transcribed only when an inducer molecule is present in the medium in or on which 
the organism is cultivated. In the present context, a transcription regulatory 
sequence includes a promoter sequence and can further include cis-active 

30 sequences for regulated expression of an associated sequence in response to 
environmental signals. 

One DNA portion or sequence is downstream of second DNA portion or 
sequence when it is located 3 1 of the second sequence. One DNA portion or 
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sequence is upstream of a second DNA portion or sequence when it is located 5* of 
that sequence. 

One DNA molecule or sequence and another are heterologous to another if 
5 the two are not derived from the same ultimate natural source. The sequences 
may be natural sequences, or at least one sequence can be designed by man, as 
in the case of a multiple cloning site region. The two sequences can be derived 
from two different species or one sequence can be produced by chemical synthesis 
provided that the nucleotide sequence of the synthesized portion was not derived 
10 from the same organism as the other sequence. 

An isolated or substantially pure nucleic acid molecule or polynucleotide is a 
polynucleotide which is substantially separated from other polynucleotide 
sequences which naturally accompany a native transcription regulatory sequence. 
15 The term embraces a polynucleotide sequence which has been removed from its 
naturally occurring environment, and includes recombinant or cloned DNA isolates, 
chemically synthesized analogues and analogues biologically synthesized by 
heterologous systems. 

20 A polynucleotide is said to encode a polypeptide if, in its native state or when 

manipulated by methods known to those skilled in the art, it can be transcribed 
and/or translated to produce the polypeptide or a fragment thereof. The anti-sense 
strand of such a polynucleotide is also said to encode the sequence. 

25 A nucleotide sequence is operably linked when it is placed into a functional 

relationship with another nucleotide sequence. For instance, a promoter is 
operably linked to a coding sequence if the promoter effects its transcription or 
expression. Generally, operably linked means that the sequences being linked are 
contiguous and, where necessary to join two protein coding regions, contiguous 

30 and in reading frame. However, it is well known that certain genetic elements, such 
as enhancers, may be operably linked even at a distance, i.e., even if not 
contiguous. 
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The term recombinant polynucleotide refers to a polynucleotide which is 
made by the combination of two otherwise separated segments of sequence 
accomplished by the artificial manipulation of isolated segments of polynucleotides 
by genetic engineering techniques or by chemical synthesis. In so doing one may 
5 join together polynucleotide segments of desired functions to generate a desired 
combination of functions. 

Polynucleotide probes include an isolated polynucleotide attached to a label 
or reporter molecule and may be used to identify and isolate other sequences, for 
10 example, those from other species or other strains. Probes comprising synthetic 
' oligonucleotides or other polynucleotides may be derived from naturally occurring 
or recombinant single or double stranded nucleic acids or be chemically 
synthesized. Polynucleotide probes may be labeled by any of the methods known 
in the art, e.g., random hexamer labeling, nick translation, or the Klenow fill-in 
15 reaction. 

Large amounts of the polynucleotides may be produced by replication in a 
suitable host cell. Natural or synthetic DNA fragments coding for a protein of 
interest are incorporated into recombinant polynucleotide constructs, typically DNA 

20 constructs, capable of introduction into and replication in a prokaryotic or eukaryotic 
cell. Usually the construct is suitable for replication in a unicellular host, such as A 
pullulans or a bacterium, but a multicellular eukaryotic host may also be 
appropriate, with or without integration within the genome of the host cell. 
Commonly used prokaryotic hosts include strains of Escherichia coli, although other 

25 prokaryotes, such as Bacillus subtilis or a pseudomonad, may also be used. 
Eukaryotic host cells include yeast, filamentous fungi, plant, insect, amphibian, 
mammalian and avian species. Such factors as ease of manipulation, ability to 
appropriately glycosylate expressed proteins, degree and control of protein 
expression, ease of purification of expressed proteins away from cellular 

30 contaminants or other factors influence the choice of the host cell. 

The polynucleotides may also be produced by chemical synthesis, e.g., by 
the phosphoramidite method described by Beaucage and Caruthers (1981) Tetra. 
Letts., 22: 1859-1862 or the triester method according to Matteuci et ai (1981) J. 
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Am. Chem. Soc, 103:3185, and may be performed on commercial automated 
oligonucleotide synthesizers. A double-stranded fragment may be obtained from 
the single stranded product of chemical synthesis either by synthesizing the 
complementary strand and annealing the strand together under appropriate 
5 conditions or by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

DNA constructs prepared for introduction into a prokaryotic or eukaryotic 
host will typically comprise a replication system (i.e. vector) recognized by the host, 

10 including the intended DNA fragment encoding the desired polypeptide, and will 1 
preferably also include transcription and translational initiation regulatory 
sequences operably linked to the polypeptide-encoding segment. Expression 
systems (expression vectors) may include, for example, an origin of replication or 
autonomously replicating sequence (ARS) and expression control sequences, a 

15 promoter, an enhancer and necessary processing information sites, such as 
ribosome-binding sites, RNA splice sites, polyadenylation sites, transcriptional 
terminator sequences, and mRNA stabilizing sequences. Signal peptides may also 
be included where appropriate from secreted polypeptides of the same or related 
species, which allow the protein to cross and/or lodge in cell membranes or be 

20 secreted from the cell. 

An appropriate promoter and other necessary vector sequences will be 
selected so as to be functional in the host. Examples of workable combinations of 
cell lines and expression vectors are described in Sambrook et al. (1989) vide infra; 

25 Ausubel et al. (Eds.) (1995) Current Protocols in Molecular Biology, Greene 
Publishing and Wiley Interscience, New York; and Metzger et al. (1988) Nature, 
334: 31-36. Many useful vectors for expression in bacteria, yeast, fungal, 
mammalian, insect, plant or other cells are well known in the art and may be 
obtained such vendors as Stratagene, New England Biolabs, Promega Biotech, 

30 and others. In addition, the construct may be joined to an amplifiable gene (e.g., 
DHFR) so that multiple copies of the gene may be made. For appropriate enhancer 
and other expression control sequences, see also Enhancers and Eukaryotic Gene 
Expression, Cold Spring Harbor Press, N.Y. (1983). While such expression vectors 
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may replicate autonomously, they may less preferably replicate by being inserted 
into the genome of the host cell. 

Expression and cloning vectors will likely contain a selectable marker, that is, 
5 a gene encoding a protein necessary for the survival or growth of a host cell 
transformed with the vector. Although such a marker gene may be carried on 
another polynucleotide sequence co-introduced into the host cell, it is most often 
contained on the cloning vector. Only those host cells into which the marker gene 
has been introduced will survive and/or grow under selective conditions. Typical 
10 selection genes encode proteins that (a) confer resistance to antibiotics or other 
toxic substances, e.g., ampicillin, neomycin, methotrexate, etc.; (b) complement 
auxotrophic deficiencies; or (c) supply critical nutrients not available from complex 
media. The choice of the proper selectable marker will depend on the host cell; 
appropriate markers for different hosts are known in the art. 

15 

Recombinant host cells, in the present context, are those which have been 
genetically modified to contain an isolated DNA molecule of the instant invention. 
The DNA can be introduced by any means known to the art which is appropriate for 
the particular type of cell, including without limitation, transformation, lipofection or 
20 electroporation. 

It is recognized by those skilled in the art that the DNA sequences may vary 
due to the degeneracy of the genetic code and codon usage. All DNA sequences 
which code for the polypeptide or protein of interest are included in this invention. 

25 

Additionally, it will be recognized by those skilled in the art that allelic 
variations may occur in the DNA sequences which will not significantly change 
activity of the amino acid sequences of the peptides which the DNA sequences 
encode. All such equivalent DNA sequences are included within the scope of this 
30 invention and the definition of the regulated promoter region. The skilled artisan 
will understand that the sequence of the exemplified sequence can be used to 
identify and isolate additional, nonexemplified nucleotide sequences which are 
functionally equivalent to the sequences given. 
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Mutational, insertional, and deletional variants of the disclosed nucleotide 
sequences can be readily prepared by methods which are well known to those 
skilled in the art. These variants can be used in the same manner as the 
exemplified primer sequences so long as the variants have substantial sequence 
5 homology with the original sequence. As used herein, substantial sequence 
homology refers to homology which is sufficient to enable the variant polynucleotide 
to function in the same capacity as the polynucleotide from which the probe was 
derived. Preferably, this homology is greater than 80%, more preferably, this 
homology is greater than 85%, even more preferably this homology is greater than 

10 90%, and most preferably, this homology is greater than 95%. The degree of 
homology or identity needed for the variant to function in its intended capacity 
depends upon the intended use of the sequence. It is well within the skill of a 
person trained in this art to make mutational, insertional, and deletional mutations 
which are equivalent in function or are designed to improve the function of the 

15 sequence or otherwise provide a methodological advantage. 

Polymerase Chain Reaction (PGR) is a repetitive, enzymatic, primed 
synthesis of a nucleic acid sequence. This procedure is well known and commonly 
used by those skilled in this art [see Mullis, U.S. Patent Nos. 4,683,195, 4,683,202, 

20 and 4,800,159; Saiki et al. (1985) Science 230:1350-1354]. PCR is based on the 
enzymatic amplification of a DNA fragment of interest that is flanked by two 
oligonucleotide primers that hybridize to opposite strands of the target sequence. 
The primers are oriented with the 3' ends pointing towards each other. Repeated 
cycles of heat denaturation of the template, annealing of the primers to their 

25 complementary sequences, and extension of the annealed primers with a DNA 
polymerase result in the amplification of the segment defined by the 5' ends of the 
PCR primers. Since the extension product of each primer can serve as a template 
for the other primer, each cycle essentially doubles the amount of DNA template 
produced in the previous cycle. This results in the exponential accumulation of the 

30 specific target fragment, up to several million-fold in a few hours. By using a 
thermostable DNA polymerase such as the Tag polymerase, which is isolated from 
the thermophilic bacterium Thermus aquaticus, the amplification process can be 
completely automated. Other enzymes which can be used are known to those 
skilled in the art. 
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It is well known in the art that the polynucleotide sequences of the present 
invention can be truncated and/or mutated such that certain of the resulting 
fragments and/or mutants of the original full-length sequence can retain the desired 
characteristics of the full-length sequence. A wide variety of restriction en2ymes 

5 which are suitable for generating fragments from larger nucleic acid molecules are 
well known. In addition, it is well known that Sa/31 exonuclease can be 
conveniently used for time-controlled limited digestion of DNA. See, for example, 
Maniatis (1982) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, New York, pages 135-139, incorporated herein by reference. See also 

10 Wei et al. (1983 J. Biol. Chem. 258:13006-13512. By use of Sa/31 exonuclease 
(commonly referred to as "erase-a-base" procedures), the ordinarily skilled artisan 
can remove nucleotides from either or both ends of the subject nucleic acids to 
generate a wide spectrum of fragments which are functionally equivalent to the 
subject nucleotide sequences. One of ordinary skill in the art can, in this manner, 

15 generate hundreds of fragments of controlled, varying lengths from locations all 
along the original molecule. The ordinarily skilled artisan can routinely test or 
screen the generated fragments for their characteristics and determine the utility of 
the fragments as taught herein. It is also well known that the mutant sequences of 
the full length sequence, or fragments thereof, can be easily produced with site 

20 directed mutagenesis. See, for example, Larionov, O.A. and Nikiforov, V.G. (1982) 
Genetika 18(3):349-59; Shortle, D, DiMaio, D., and Nathans, D. (1981) Annu. Rev. 
Genet. 15:265-94; both incorporated herein by reference. The skilled artisan can 
routinely produce deletion-, insertion-, or substitution-type mutations and identify 
those resulting mutants which contain the desired characteristics of the full length 

25 wild-type sequence, or fragments thereof, i.e., those which retain promoter activity 
and also provide transcription of downstream sequence. 

Following the teachings herein and using knowledge and techniques well 
known in the art, the skilled worker will be able to make a large number of operative 
30 embodiments having equivalent DNA sequences to those listed herein without the 
expense of undue experimentation. 

As used herein percent sequence identity of two nucleic acids is determined 
using the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 
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87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul et a/. (1990) J. Mol. Biol. 215:402-410. BLAST nucleotide 
searches are performed with the NBLAST program, score = 100, wordlength = 12, 
5 to obtain nucleotide sequences with the desired percent sequence identity. To 
obtain gapped alignments for comparison purposes, Gapped BLAST is used as 
described in Altschul et al. (1997) Nucl. Acids. Res. 25:3389-3402. When utilizing 
BLAST and Gapped BLAST programs, the default parameters of the respective 
programs (NBLAST and XBLAST) are used. See, for example, the National Center 
10 for Biotechnology Information website on the internet. 

Techniques and agents for introducing and selecting for the presence of 
heterologous DNA in plant cells and/or tissue are well-known. Genetic markers 
allowing for the selection of heterologous DNA in plant cells are well-known, e.g., 

15 genes carrying resistance to an antibiotic such as kanamycin, hygromycin, 
gentamicin, or bleomycin. The marker allows for selection of successfully 
transformed plant cells growing in the medium containing the appropriate antibiotic 
because they will carry the corresponding resistance gene. In most cases the 
heterologous DNA which is inserted into plant cells contains a gene which encodes 

20 a selectable marker such as an antibiotic resistance marker, but this is not 
mandatory. An exemplary drug resistance marker is the gene whose expression 
results in kanamycin resistance, i.e., the chimeric gene containing nopaline 
synthetase promoter, Tn5 neomycin phosphotransferase II and nopaline 
synthetase 3' non-translated region described by Rogers et a/., Methods for Plant 

25 Molecular Biology, A. Weissbach and H. Weissbach, eds., Academic Press, Inc., 
San Diego, CA(1988). 

Techniques for genetically engineering plant cells and/or tissue with an 
expression cassette comprising an inducible promoter or chimeric promoter fused 
30 to a heterologous coding sequence, including possibly an antisense DNA construct 
and/or a DNA construct designed to elicit double-stranded RNA-mediated gene 
silencing, followed by a transcription termination sequence are to be introduced into 
the plant cell or tissue by Agmbacterium- mediated transformation, electroporation, 
microinjection, particle bombardment or other techniques known to the art. The 
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expression cassette advantageously further contains a marker allowing selection of 
the heterologous DNA in the plant cell, e.g., a gene carrying resistance to an 
antibiotic such as kanamycin, hygromycin, gentamicin, or bleomycin. 

A DNA construct carrying a plant-expressible gene or other DNA of interest 
can be inserted into the genome of a plant by any suitable method. Such methods 
may involve, for example, the use of liposomes, electroporation, diffusion, particle 
bombardment, microinjection, gene gun, chemicals that increase free DNA uptake, 
e.g., calcium phosphate coprecipitation, viral vectors, and other techniques 
practiced in the art. Suitable plant transformation vectors include those derived 
from a Ti plasmid of Agrobacterium tumefaciens, such as those disclosed by 
Herrera-Estrella (1983), Bevan (1983), Klee (1985) and EPO publication 120,516 
(Schilperoort et a/.). In addition to plant transformation vectors derived from the Ti 
or root-inducing (Ri) plasmids of Agrobacterium, alternative methods can be used 
to insert the DNA constructs of this invention into plant cells. 

The choice of vector in which the DNA of interest is operatively linked 
depends directly, as is well known in the art, on the functional properties desired, 
e.g., replication, protein expression, and the host cell to be transformed, these 
being limitations inherent in the art of constructing recombinant DNA molecules. 
The vector desirably includes a prokaryotic replicon, i.e., a DNA sequence having 
the ability to direct autonomous replication and maintenance of the recombinant 
DNA molecule extra-chromosomally when introduced into a prokaryotic host cell, 
such as a bacterial host cell. Such replicons are well known in the art. In addition, 
preferred embodiments that include a prokaryotic replicon also include a gene 
whose expression confers a selective advantage, such as a drug resistance, to the 
bacterial host cell when introduced into those transformed cells. 

Typical bacterial drug resistance genes are those that confer resistance to 
ampicillin or tetracycline, among other selective agents. The neomycin 
phosphotransferase gene has the advantage that it is expressed in eukaryotic as 
well as prokaryotic cells. 
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Those vectors that include a prokaryotic replicon also typically include 
convenient restriction sites for insertion of a recombinant DNA molecule of the 
present invention. Typical of such vector plasmids are pUC8, pUC9, pBR322, and 
pBR329 available from BioRad Laboratories (Richmond, CA) and pPL, pK and 
K223 available from Pharmacia (Piscataway, NJ), and pBLUESCRIPT and pBS 
available from Stratagene (La Jolla, CA). A vector of the present invention may 
also be a Lambda phage vector including those Lambda vectors described in 
Molecular Cloning: A Laboratory Manual, Second Edition, Maniatis et a/., eds., 
Cold Spring Harbor Press (1989) and the Lambda ZAP vectors available from 
Stratagene (La Jolla, CA). Other exemplary vectors include pCMU [Nilsson et al. 
(1989) Cell 58:707]. Other appropriate vectors may also be synthesized, according 
to known methods; for example, vectors pCMU/Kb and pCMUII used in various 
applications herein are modifications of pCMUIV [Nilsson, (1989) supra]. 

Typical expression vectors capable of expressing a recombinant nucleic acid 
sequence in plant cells and capable of directing stable integration within the host 
plant cell include vectors derived from the tumor-inducing (Ti) plasmid of 
Agrobacterium tumefaciens described by Rogers et al. (1987) Meth. in Enzymol. 
153:253-277, and several other expression vector systems known to function in 
plants. See for example, Verma et al., No. WO87/00551; Cocking and Davey 
(1987) Science 236:1259-1262. 

A transgenic plant can be produced by any means known to the art, 
including but not limited to Agrobacterium tumefaciens-med\ated DNA transfer, 
preferably with a disarmed T-DNA vector, electroporation, direct DNA transfer, and 
particle bombardment [See Davey et al. (1989) Plant Mol. Biol. 13:275; Walden and 
Schell (1990) Eur. J. Biochem. 192:563; Joersbo and Burnstedt (1991) Physiol. 
Plant. 81:256; Potrykus (1991) Annu. Rev. Plant Physiol. Plant Mol. Biol. 42:205; 
Gasser and Fraley (1989) Science 244:1293; Leemans (1993) Bio/Technology 
11:522; Beck et al. (1993) Bio/Technology 11:1524; Koziel et al. (1993) 
Bio/Technology 11:194; Vasil et al. (1993) Bio/Technology 11:1533 and Gelvin, 
S.B. (1999) Curr. Opin. Biotech. 9:227-232]. Techniques are well-known to the art 
for the introduction of DNA into monocots as well as dicots, as are the techniques 
for culturing such plant tissues and regenerating those tissues. 
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Standard techniques for cloning, DNA isolation, amplification and 
purification, for enzymatic reactions involving DNA ligase, DNA polymerase, 
restriction endonucleases and the like, and various separation techniques are those 
known and commonly employed by those skilled in the art. A number of standard 

5 techniques are described in Sambrook et al. (1989) Molecular Cloning, Second 
Edition, Cold Spring Harbor Laboratory, Plainview, New York; Maniatis etal. (1982) 
Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New York; Wu (ed.) 
(1993) Meth. Enzymol. 218, Part I; Wu (ed.) (1979) Meth. Enzymol. 68; Wu et al. 
(eds.) (1983) Meth. Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. 

10 Enzymol. 65; Miller (ed.) (1972) Experiments in Molecular Genetics, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York; Old and Primrose (1981) 
Principles of Gene Manipulation, University of California Press, Berkley; Schleif and 
Wensink (1982) Practical Methods in Molecular Biology, Glover (ed.) (1985) DAW 
Cloning Vol. I and II, IRL Press, Oxford, UK; Hames and Higgins (eds.) (1985) 

15 Nucleic Acid Hybridization, IRL Press, Oxford, UK; Setlow and Hollaender (1979) 
Genetic Engineering: Principles and Methods, Vols. 1-4, Plenum Press, New York; 
and Ausubel et al. (1992) Current Protocols in Molecular Biology, Greene/Wiley, 
New York, NY. Abbreviations and nomenclature, where employed, are deemed 
standard in the field and commonly used in professional journals such as those 

20 cited herein. 

All references cited in the present application are incorporated in their 
entirety herein by reference to the extent not inconsistent herewith. 
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CLAIMS 

We claim: 

1 . An isolated nucleic acid encoding a polypeptide or a fragment thereof having 
galacturonosyltransferase (GalAT) activity. 

2. The nucleic acid of claim 1 wherein the polypeptide or the fragment has 
approximately 50% amino acid sequence similarity with the corresponding 
sequence as set forth in SEQ ID NO: 2. 

3. The nucleic acid of claim 2 wherein the amino acid molecule is selected from 
the group consisting of the sequences as set forth in SEQ ID NOs: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 
and 50. 

4. The nucleic acid of claim 3 wherein the polypeptide comprises the amino 
acid sequence as set forth in SEQ ID NO: 2. 

5. The nucleic acid of claim 4 wherein the polypeptide is encoded by the 
nucleic acid sequence as set forth in SEQ ID NO: 1 . 

6. An isolated polypeptide or a fragment thereof having 
galacturonosyltransferase GalAT activity wherein the polypeptide or the 
fragment has approximately 50% amino acid sequence similarity with the 
corresponding amino acid sequence as shown in SEQ ID NO: 2. 

7. The polypeptide or the fragment of claim 6 which comprises the amino acid 
sequence selected from the group consisting of the sequences as set forth in 
SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, and 50, or the corresponding sequence thereto. 

8. The polypeptide or the fragment of claim 7 which comprises the amino acid 
sequence as set forth in SEQ ID NO: 2 or the corresponding sequence 
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thereto. 

9. The polypeptide or the fragment of claim 7 wherein the amino acid sequence 
is encoded by the nucleic acid as set forth in SEQ ID NO: 1 . 

10. An antibody which specifically recognizes the polypeptide or the fragment of 
claims 7 or 8. 

11. An expression vector comprising in operable linkage the nucleic acid 
according to any one of claims 1 , 2, 3 or 5 and a plant-expressible promoter. 

12. The expression vector of claim 11 wherein said promoter is heterologous to 
said nucleic acid. 

13. A transgenic plant which has been transformed with the expression vector of 
claims 11 or 12. 

14. A transgenic plant having modified pectin. 

15. A transgenic plant having altered GalAT activity wherein the altered activity 
is due to a mutation in the GyAMTgene. 

1 6. Progeny of the transgenic plant of claims 13, 1 4 or 1 5. 

17. Modified pectin isolated from the transgenic plant of claims 14 or 15. 

18. A product comprising the modified pectin of claim 17. 

19. A method of generating a plant with altered GalAT activity by mutating the 
GALAT gene. 
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20. A method of preparing a polymer comprising a galacturonic acid and a 
polymer with a GALAT protein under conditions suitable to form at least one 
covalent linkage between the galacturonic acid and the polymer. 

21. The method of claim 20 wherein said polymer is selected from the group 
consisting of homogalacturonan, rhamnogalacturdnan I, 
rhamnogalacturonan II, xylogalacturonan, apiogalacturonan or other 
galacturonic containing polymer. 

22. The method of claim 21 f wherein said polymer is homogalacturonan. 

23. The method of claims 20 or 21 wherein the GALAT protein comprises the 
amino acid sequence as set forth in SEQ ID NO: 2 or a fragment thereof 
having GalAT activity. 
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SEQUENCE LISTING 

<110> university of Georgia Research Foundation, inc 
Mohnen, Debra 
Hahn, Michael G. 
Kolli, venkata S.K. 
Doong, Ron L. 
Sterling, Uason D. ~ 

<120> Gal acturonosyl transferases, nucleic acids encoding same, and uses 
therefor 

<130> 14-03 WO 

<140> Not assigned 
<141> 2004-02-05 

<150> US 60/445,539 
<151> 2003-02-06 

<160> 50 

<170> Patentin version 3.2 

<210> 1 
<211> 2022 
<212> DNA 

<213> Arabidopsis thai nana 

atggcgctaa agcgagggct atctggagtt aaccggatta gaggaagtgg tggtggatct 60 
cgatctgtgc ttgtgcttct catatttttc tgtgtttttg cacctctttg cttctttgtt 120 
ggccgaggag tgtatatcga ttcctcaaat gattattcaa ttgtttctgt gaagcagaat 180 
cttgactgga gagaacgttt agcaatgcaa tctgttagat ctcttttctc gaaagagata 240 
ctagatgtta tagcaaccag cacagctgat ttgggtcctc ttagccttga ttcttttaag 
aaaaacaatt tgtctgcatc atggcgggga accggagtag acccctcctt tagacattct 



300 
360 



gagaatccag caactcctga tgtcaaatct aataacctga atgaaaaacg tgacagcatt 420 
tcaaaagata gtatccatca gaaagttgag acacctacaa agattcacag aaggcaacta 



480 



600 
660 



agagagaaaa ggcgtgagat gcgggcaaat gagttagttc agcacaatga tgacacgatt 540 

ttgaaactcg aaaatgctgc cattgaacgc tctaagtctg ttgattctgc agtccttggt 

aaatacagta tttggagaag agaaaatgag aatgacaact ctgattcaaa tatacgcttg 

atgcgggatc aagtaataat ggctagagtc tatagtggga ttgcaaaatt gaaaaacaag 720 

aacgatttgt tacaagaact ccaggcccga cttaaggaca gccaacgggt tttgggggaa 780 

gcaacatctg atgctgatct tcctcggagt gcgcatgaga aactcagagc catgggtcaa 840 

gtcttggcta aagctaagat gcagttatat gactgcaagc tggttactgg aaagctgaga 

gcaatgcttc agactgccga cgaacaagtg aggagcttaa agaagcagag tacttttctg 

gctcagttag cagcaaaaac cattccaaat cctatccatt gcctatcaat gcgcttgact 
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atcgattact atcttctgtc tccggagaaa agaaaattcc ctcggagtga aaacctagaa 1080 

aaccctaatc tttatcatta tgccctcttt tccgacaatg tattagctgc atcagtagtt 1140 

gttaactcaa ccatcatgaa tgccaaggat ccttctaagc atgtttttca ccttgtcacg 1200 

gataaactca atttcggagc aatgaacatg tggttcctcc taaacccacc cggaaaggca 1260 

accatacatg tggaaaacgt cgatgagttt aagtggctca attcatctta ctgtcctgtc 1320 

cttcgtcagc ttgaatctgc agcaatgaga gagtactatt ttaaagcaga ccatccaact 1380 

tcaggctctt cgaatctaaa atacagaaac ccaaagtatc tatccatgtt gaatcacttg 1440 

agattctacc tccctgaggt ttatcccaag ctgaacaaaa tcctcttcct ggacgatgac 1500 

atcattgttc agaaagactt gactccactc tgggaagtta acctgaacgg caaagtcaac 1560 

ggtgcagtcg aaacctgtgg ggaaagtttc cacagattcg acaagtatct caacttttcg 1620 

aatcctcaca ttgcgaggaa cttcaatcca aatgcttgtg gatgggctta tggaatgaac 1680 

atgttcgacc taaaggaatg gaagaagaga gacatcactg gtatatacca caagtggcaa 1740 

aacatgaatg agaacaggac actatggaag ctagggacat tgccaccagg attaataaca 1800 

ttctacggat taacacatcc cttaaacaag gcgtggcatg tgctgggact tggatataac 1860 

ccgagtatcg acaagaagga cattgagaat gcagcagtgg ttcactataa cgggaacatg 1920 

aaaccatggt tggagttggc aatgtccaaa tatcggccgt attggaccaa gtacatcaag 1980 

tttgatcacc catatcttcg tcgttgcaac cttcatgaat aa 2022 

<210> 2 

<211> 673 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 2 

Met Ala Leu Lys Arg Gly Leu ser Gly Val Asn Arg He Arg Gly Ser 
1 5 10 15 

Gly Gly Gly Ser Arg ser val Leu val Leu Leu lie Phe Phe Cys val 
20 ~ 25 30 

Phe Ala Pro Leu cys Phe Phe val Gly Arg Gly val Tyr lie Asp Ser 
35 40 45 

Ser Asn Asp Tyr Ser lie val ser val Lys Gin Asn Leu Asp Trp Arg 
50 55 60 

Glu Arg Leu Ala Met Gin ser val Arg ser Leu Phe Ser Lys Glu lie 
65 70 75 80 
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Leu Asp val He Ala Thr Ser Thr Ala Asp Leu Gly Pro Leu Ser Leu 
85 90 95 

Asp Ser Phe Lys Lys Asn Asn Leu ser Ala ser Trp Arg Gly Thr Gly 
100 105 110 

Val Asp Pro Ser Phe Arg His ser Glu Asn Pro Ala Thr Pro Asp Val 
115 120 125 

Lys Ser Asn Asn Leu Asn Glu Lys Arg Asp ser lie ser Lys Asp Ser 
130 135 140 

lie His Gin Lys Val Glu Thr Pro Thr Lys lie His Arg Arg Gin Leu 
145 150 155 160 

Arq Glu Lys Arg Arg Glu Met Arg Ala Asn Glu Leu Val Gin His Asn 
165 170 175 

Asp Asp Thr lie Leu Lys Leu Glu Asn Ala Ala lie Glu Arg ser Lys 
180 185 190 

ser val Asp ser Ala Val Leu Gly Lys Tyr ser lie Trp Arg Arg Glu 
195 200 205 

Asn Glu Asn Asp Asn Ser Asp Ser Asn lie Arg Leu Met Arg Asp Gin 
210 215 220 

val lie Met Ala Arg Val Tyr ser Gly lie Ala Lys Leu Lys Asn Lys 
225 230 235 240 

Asn Asp Leu Leu Gin Glu Leu Gin Ala Arg Leu Lys Asp ser Gin Arg 
245 250 255 

val Leu Gly Glu Ala Thr Ser Asp Ala Asp Leu Pro Arg ser Ala His 
260 265 270 

Glu Lys Leu Arg Ala Met Gly Gin val Leu Ala Lys Ala Lys Met Gin 
275 280 285 

Leu Tyr Asp Cys Lys Leu val Thr Gly Lys Leu Arg Ala Met Leu Gin 
290 295 300 

Thr Ala Asp Glu Gin val Arg Ser Leu Lys Lys Gin ser Thr Phe Leu 
305 310 315 320 

Ala Gin Leu Ala Ala Lys Thr lie Pro Asn Pro lie His cys Leu ser 
325 330 335 
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Met Arg Leu Thr lie Asp Tyr Tyr Leu Leu ser Pro Glu Lys Arg Lys 
340 345 350 

Phe Pro Arg ser Glu Asn Leu Glu Asn Pro Asn Leu Tyr His Tyr Ala 
355 360 365 

Leu Phe ser Asp Asn Val Leu Ala Ala Ser val Val Val Asn Ser Thr 
370 375 380 

lie Met Asn Ala Lys Asp Pro Ser Lys His val Phe His Leu val Thr 
385 390 395 400 

Asp Lys Leu Asn Phe Gly Ala Met Asn Met Trp Phe Leu Leu Asn Pro 
405 410 415 

Pro Gly Lys Ala Thr lie His val Glu Asn Val Asp Glu Phe Lys Trp 
420 425 430 

Leu Asn Ser Ser Tyr cys Pro val Leu Arg Gin Leu Glu ser Ala Ala 
435 440 445 

Met Arg Glu Tyr Tyr Phe Lys Ala Asp His Pro Thr Ser Gly Ser ser 
450 455 460 

Asn Leu Lys Tyr Arg Asn Pro Lys Tyr Leu ser Met Leu Asn His Leu 
465 470 475 480 

Arg Phe Tyr Leu Pro Glu Val Tyr Pro Lys Leu Asn Lys lie Leu Phe 
485 490 495 

Leu Asp Asp Asp lie lie val Gin Lys Asp Leu Thr Pro Leu Trp Glu 
500 505 510 

val Asn Leu Asn Gly Lys val Asn Gly Ala val Glu Thr cys Gly Glu 
515 520 525 

Ser Phe His Arg Phe Asp Lys Tyr Leu Asn Phe ser Asn Pro His lie 
530 535 540 

Ala Arg Asn Phe Asn Pro Asn Ala cys Gly Trp Ala Tyr Gly Met Asn 
545 550 555 560 

Met Phe Asp Leu Lys Glu Trp Lys Lys Arg Asp lie Thr Gly lie Tyr 
565 570 575 

His Lys Trp Gin Asn Met Asn Glu Asn Arg Thr Leu Trp Lys Leu Gly 
580 585 590 
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Thr Leu Pro Pro Gly Leu lie Thr Phe Tyr Gly Leu Thr His Pro Leu 
595 600 605 

Asn Lys Ala Trp His Val Leu Gly Leu Gly Tyr Asn Pro Ser lie Asp 
610 615 620 

Lys Lys Asp lie Glu Asn Ala Ala val Val His Tyr Asn Gly Asn Met 
625 630 635 640 

Lys Pro Trp Leu Glu Leu Ala Met ser Lys Tyr Arg Pro Tyr Trp Thr 
645 650 655 

Lys Tyr lie Lys Phe Asp His Pro Tyr Leu Arg Arg cys Asn Leu His 
660 665 670 

Glu 



<210> 3 

<211> 1860 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 3 
atgaaaggcg 


gaggcggtgg 


tggaggaggt 


ggtggcggag gaaaacgccg gtggaaagtt 


60 


ctggtgattg 


gagttttggt 


tcttgttatt 


ctttctatgc ttgttcctct tgctttctta 


120 


ctcggtcttc 


acaatggctt 


tcactctcct 


ggatttgtca ctgttcaacc ggcttcttca 


180 


tttgagagct 


ttaccagaat 


caatgctact 


aagcatacac agagagatgt atccgaacgg 


240 


gtcgatgagg 


ttcttcaaaa 


aatcaatcca 


gttcttccca agaaaagcga cataaacgtg 


300 


ggttccagag 


atgtgaatgc 


aacaagcggc 


actgattcta aaaaaagagg attaccagtg 


360 


tccccaactg 


ttgttgccaa 


tccaagccct 


gcaaataaaa caaaatcgga agcctcatat 


420 


acaggtgttc 


agaggaaaat 


agtaagtggt 


gatgaaactt ggagaacttg tgaagtgaaa 


480 


tatgggagct 


actgcctctg 


gagggaggaa 


aataaggaac caatgaaaga tgccaaggtg 


540 


aagcaaatga 


aggaccagct 


gtttgtggct 


agagcatact atcccagtat tgctaaaatg 


600 


ccttctcaaa 


gcaagttgac 


tcgggatatg 


aaacagaata tccaagagtt tgagcgtatt 


660 


cttagtgaaa 


gttctcaaga 


tgctgacctt 


ccaccacagg ttgataaaaa gttgcagaag 


720 


atggaagctg 


taattgcaaa 


ggcaaagtct 


tttccagtcg actgtaacaa tgttgacaag 


780 


aaattgagac 


agatccttga 


tttgactgag 


gatgaagcta gtttccacat gaaacagagt 


840 


gtgttcctct 


accagcttgc 


agtacagaca 


atgcctaaga gtcttcattg cttgtcaatg 


900 


cgactaactg 


tggaacattt 


caagtcagat 


tcacttgagg atcccattag tgagaaattt 
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tcagatccct cattacttca ctttgttatc atctccgata atatactagc atcgtccgtt 1020 

gtgatcaact caacggttgt acatgcaagg gacagtaaaa actttgtttt ccatgtactg 1080 

acagacgagc agaattactt tgcaatgaaa caatggttta ttaggaatcc ttgcaaacaa 1140 

tcaactgttc aagtattgaa cattgaaaaa ctcgagctgg acgattctga tatgaaactg 1200 

tctttgtctg cggagttccg tgtttccttc cccagtggtg accttttggc gtctcaacag 1260 

aatagaacac actacttatc ccttttctct caatctcact atcttcttcc caaattattt 1320 

gacaaattgg agaaggttgt gattctggat gatgacgttg tagtccagcg agacttatct 1380 

cccctttggg accttgatat ggaagggaaa gtgaatggcg ctgttaagtc gtgcactgtg 1440 

agattgggtc agctaaggag tctcaagaga ggaaattttg ataccaatgc ttgtctctgg 1500 

atgtctggtt tgaatgtcgt tgatcttgct agatggaggg cattgggtgt ttcagaaacc 1560 

tatcaaaaat attataaaga gatgagtagt ggagatgagt cgagcgaagc aattgcattg 1620 

caggcaagct tgctcacatt tcaagaccaa gtatatgctc ttgacgacaa atgggctcta 1680 

tcagggcttg gttatgacta ctacatcaat gcacaagcca taaaaaacgc agccatattg 1740 

cactataacg ggaacatgaa gccgtggctt gagctgggaa tcccaaatta caaaaactat 1800 

tggagaaggc atctgagtcg ggaagatcgg ttcttgagtg actgtaacgt gaatccttga 1860 

<210> 4 
<211> 619 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 4 

Met Lys Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Lys Arg 
15 10 15 

Arg Trp Lys Val Leu Val lie Gly val Leu val Leu Val lie Leu Ser 
20 25 30 

Met Leu val Pro Leu Ala Phe Leu Leu Gly Leu His Asn Gly Phe His 
35 40 45 

ser Pro Gly Phe val Thr Val Gin Pro Ala Ser Ser Phe Glu ser Phe 
50 55 60 

Thr Arg lie Asn Ala Thr Lys His Thr Gin Arg Asp val Ser Glu Arg 
65 70 75 80 

Val Asp Glu Val Leu Gin Lys lie Asn Pro val Leu Pro Lys Lys Ser 
85 90 95 
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Asp lie Asn Val Gly Ser Arg Asp Val Asn Ala Thr Ser Gly Thr Asp 
100 105 110 

ser Lys Lys Arg Gly Leu Pro Val Ser Pro Thr val val Ala Asn Pro 
115 120 125 

Ser Pro Ala Asn Lys Thr Lys ser Glu Ala Ser Tyr Thr Gly val Gin 
130 135 140 

Arg Lys lie val ser Gly Asp Glu Thr Trp Arg Thr Cys Glu val Lys 
145 150 155 160 

Tyr Gly ser Tyr cys Leu Trp Arg Glu Glu Asn Lys Glu Pro Met Lys 
165 170 175 

Asp Ala Lys val Lys Gin Met Lys Asp Gin Leu Phe Val Ala Arg Ala 
180 185 190 

Tyr Tyr Pro Ser lie Ala Lys Met Pro Ser Gin Ser Lys Leu Thr Arg 
195 200 205 

Asp Met Lys Gin Asn lie Gin Glu Phe Glu Arg lie Leu ser Glu Ser 
210 215 220 

Ser Gin Asp Ala Asp Leu Pro Pro Gin val Asp Lys Lys Leu Gin Lys 
225 230 235 240 

Met Glu Ala Val lie Ala Lys Ala Lys Ser Phe Pro Val Asp Cys Asn 
245 250 255 

Asn Val Asp Lys Lys Leu Arg Gin lie Leu Asp Leu Thr Glu Asp Glu 
260 265 270 

Ala ser Phe His Met Lys Gin ser val Phe Leu Tyr Gin Leu Ala Val 
275 280 285 

Gin Thr Met Pro Lys ser Leu His cys Leu ser Met Arg Leu Thr val 
290 295 300 

Glu His Phe Lys Ser Asp ser Leu Glu Asp Pro lie Ser Glu Lys Phe 
305 310 315 320 

ser Asp Pro ser Leu Leu His Phe val He lie ser Asp Asn lie Leu 
325 330 335 

Ala ser Ser val val lie Asn ser Thr Val Val His Ala Arg Asp Ser 
340 345 350 
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lvs Asn Phe Val Phe His Val Leu Thr Asp Glu Gin Asn Tyr Phe Ala 
3 355 360 365 

Met Lys Gin Trp Phe He Arg Asn Pro cys Lys Gin ser Thr Val Gin 
370 375 380 

val Leu Asn He Glu Lys Leu Glu Leu Asp Asp ser Asp Met Lys Leu 
385 390 395 4W 

ser Leu Ser Ala Glu Phe Arg Val ser Phe Pro ser Gly Asp Leu Leu 
405 410 415 

Ala ser Gin Gin Asn Arg Thr His Tyr Leu ser Leu Phe ser Gin ser 
420 425 430 

His Tyr Leu Leu Pro Lys Leu Phe Asp Lys Leu Glu Lys val val lie 
435 440 445 

Leu Asp Asp Asp val val Val Gin Arg Asp Leu ser Pro Leu Trp Asp 
450 455 4b0 

Leu Asp Met Glu Gly Lys val Asn Gly Ala Val Lys Ser cys Thr val 
465 470 475 480 

Arg Leu Gly Gin Leu Arg ser Leu Lys Arg Gly Asn Phe Asp Thr Asn 
3 485 490 435 

Ala cys Leu Trp Met ser Gly Leu Asn Val val Asp Leu Ala Arg Trp 
500 505 

Arg Ala Leu Gly val ser Glu Thr Tyr Gin Lys Tyr Tyr Lys Glu Met 
515 520 525 

ser ser Gly Asp Glu Ser ser Glu Ala He Ala Leu Gin Ala Ser Leu 
530 535 540 

Leu Thr Phe Gin Asp Gin Val Tyr Ala Leu Asp Asp Lys Trp Ala Leu 
545 550 555 5t>u 

Ser Gly Leu Gly Tyr Asp Tyr Tyr lie Asn Ala Gin Ala lie Lys Asn 
565 570 575 

Ala Ala He Leu His Tyr Asn Gly Asn Met Lys Pro Trp Leu Glu Leu 
580 585 590 

Gly He Pro Asn Tyr Lys Asn Tyr Trp Arg Arg His Leu Ser Arg Glu 
595 600 o05 
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Asp Arg Phe Leu Ser Asp Cys Asn Val Asn Pro 
610 615 

<210> 5 
<211> 1851 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 5 

atgatggtga agcttcgcaa tcttgttctt ttcttcatgc tcctcaccgt cgttgctcat 60 

atccttctct acaccgatcc cgctgcctcc ttcaagaccc ccttttctaa acgcgatttc 120 

ctcgaggacg taaccgcctt gactttcaat tccgatgaga atcgtttgaa tcttcttcct 180 

cgggaatctc ccgctgtgct cagaggagga ctcgtcggtg ctgtctattc cgataagaat 240 

tcacggcggc tagaccaatt gtctgctcga gttctttccg ccaccgacga tgatactcac 300 

tcacatactg acatttccat caaacaagtc actcatgatg cagcctcaga ctcgcatatt 360 

aatagggaaa atatgcatgt tcaattgacc caacaaacct ctgaaaaagt tgatgagcaa 420 

ccagagccta atgcttttgg agctaagaaa gatactggaa acgtgttgat gcctgatgct 480 

caagtgaggc atcttaaaga tcagcttatt agggcaaagg tttatctttc ccttccatct 540 

gcaaaggcca atgctcattt tgtgagagag cttcgactcc gtattaaaga agttcaacgg 

gcacttgcag atgcctccaa ggattcggat ctgccaaaga ctgctataga aaagctaaaa 

gcaatggagc aaacactggc caaaggcaag cagatccaag atgactgttc tacagtggtc 720 

aagaagctac gtgctatgct ccactccgca gatgagcagc tacgggtcca taagaagcaa 780 

accatgtttt tgactcaatt gactgctaag accattccta aaggacttca ctgccttcct 840 

ctgcgcctca ctacagacta ttatgcttta aattcatctg aacaacaatt tccaaatcag 900 

gagaaactag aagatactca gctgtatcac tatgcccttt tctctgataa tgttttggct 960 

acgtcagttg ttgttaactc taccataacc aatgcaaagc atcccttaaa gcatgtcttc 1020 

cacatcgtca cagacagact caattatgcg gcaatgagga tgtggttcct ggacaatcca 1080 

cctggcaaag ccaccatcca ggttcagaat gttgaagaat ttacatggct gaattcaagc 1140 

tacagtcccg ttctcaaaca gcttagttct agatcgatga tagattatta cttcagagcc 1200 

caccatacaa attcagacac caacttgaag ttccggaatc caaaatactt atcgatcctt 1260 

aatcatcttc gtttttactt gcctgagatc tttcccaagc tcagcaaagt gctcttcttg 1320 

gatgatgata tagttgtgca gaaggacctt tctggtcttt ggtcagttga tctgaaaggt 1380 

aatgttaacg gtgctgtaga gacgtgtggg gaaagctttc atcgctttga ccgttatctg 1440 

aacttctcaa atccactcat ttccaagaac tttgaccctc gagcttgtgg ttgggcgtat 1500 

ggtatgaatg tctttgatct ggatgaatgg aagaggcaaa acatcacaga agtttatcat 1560 
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cgatggcagg atctgaatca agaccgagaa ttgtggaagc tagggacgtt gccgcctggt 1620 

ctaatcacat tttggagacg aacatatccg ctagaccgga aatggcacat actagggctt 1680 

ggatacaacc cgagtgtgaa ccaaagggat attgagaggg cagccgtgat acactataat 1740 

ggcaacctca aaccatggct agagattggg attccaagat acagaggctt ctggtcaaag 1800 

catgtagact atgagcacgt ttatctcaga gaatgcaaca tcaatcctta g 1851 

<210> 6 

<211> 616 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 6 

Met Met val Lys Leu Arg Asn Leu val Leu Phe Phe Met Leu Leu Thr 
1 5 10 15 

Val val Ala His lie Leu Leu Tyr Thr Asp Pro Ala Ala ser Phe Lys 
20 25 30 

Thr Pro Phe ser Lys Arg Asp Phe Leu Glu Asp val Thr Ala Leu Thr 
35 40 45 

Phe Asn Ser Asp Glu Asn Arg Leu Asn Leu Leu Pro Arg Glu ser Pro 
50 55 60 

Ala Val Leu Arg Gly Gly Leu val Gly Ala val Tyr ser Asp Lys Asn 
65 ~ 70 75 80 

ser Arg Arg Leu Asp Gin Leu ser Ala Arg val Leu ser Ala Thr Asp 
85 90 95 

Asp Asp Thr His Ser His Thr Asp He ser lie Lys Gin Val Thr His 
K 100 105 110 

Asp Ala Ala Ser Asp Ser His lie Asn Arg Glu Asn Met His Val Gin 
115 120 125 

Leu Thr Gin Gin Thr ser Glu Lys val Asp Glu Gin Pro Glu Pro Asn 
130 135 140 

Ala Phe Gly Ala Lys Lys Asp Thr Gly Asn val Leu Met Pro Asp Ala 
145 150 155 160 

Gin val Arg His Leu Lys Asp Gin Leu lie Arg Ala Lys Val Tyr Leu 
165 170 175 
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Ser Leu Pro ser Ala Lys Ala Asn Ala His Phe val Arg Glu Leu Arg 
180 185 190 

Leu Arg lie Lys Glu val Gin Arg Ala Leu Ala Asp Ala Ser Lys Asp 
195 200 205 

ser Asp Leu Pro Lys Thr Ala lie Glu Lys Leu Lys Ala Met Glu Gin 
210 215 220 

Thr Leu Ala Lys Gly Lys Gin lie Gin Asp Asp Cys Ser Thr Val val 
225 230 235 240 

Lys Lys Leu Arg Ala Met Leu His Ser Ala Asp Glu Gin Leu Arg val 
245 250 255 

His Lys Lys Gin Thr Met Phe Leu Thr Gin Leu Thr Ala Lys Thr lie 
260 265 270 

Pro Lys Gly Leu His cys Leu Pro Leu Arg Leu Thr Thr Asp Tyr Tyr 
275 280 285 

Ala Leu Asn ser ser Glu Gin Gin Phe Pro Asn Gin Glu Lys Leu Glu 
290 295 300 

Asp Thr Gin Leu Tyr His Tyr Ala Leu Phe Ser Asp Asn Val Leu Ala 
305 310 315 320 

Thr Ser val val Val Asn ser Thr lie Thr Asn Ala Lys His Pro Leu 
325 330 335 

Lys His val Phe His lie val Thr Asp Arg Leu Asn Tyr Ala Ala Met 
340 345 350 

Arg Met Trp Phe Leu Asp Asn Pro Pro Gly Lys Ala Thr lie Gin Val 
355 360 365 

Gin Asn val Glu Glu Phe Thr Trp Leu Asn Ser Ser Tyr Ser Pro val 
370 375 380 

Leu Lys Gin Leu ser ser Arg ser Met lie Asp Tyr Tyr Phe Arg Ala 
385 390 395 400 

His His Thr Asn ser Asp Thr Asn Leu Lys Phe Arg Asn Pro Lys Tyr 
405 410 415 

Leu Ser lie Leu Asn His Leu Arg Phe Tyr Leu Pro Glu lie Phe Pro 
420 425 430 
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Lvs Leu ser Lys val Leu Phe Leu Asp Asp Asp lie Val val Gin Lys 
435 440 445 

Asp Leu Ser Gly Leu Trp ser Val Asp Leu Lys Gly Asn val Asn Gly 
450 455 460 

Ala Val Glu Thr cys Gly Glu ser Phe His Arg Phe Asp Arg Tyr Leu 
465 470 475 480 

Asn Phe Ser Asn Pro Leu lie ser Lys Asn Phe Asp Pro Arg Ala Cys 
485 490 495 

Gly Trp Ala Tyr Gly Met Asn val Phe Asp Leu Asp Glu Trp Lys Arg 
500 505 510 

Gin Asn lie Thr Glu Val Tyr His Arg Trp Gin Asp Leu Asn Gin Asp 
515 520 525 

Arq Glu Leu Trp Lys Leu Gly Thr Leu Pro Pro Gly Leu lie Thr Phe 
530 535 540 

Trp Arg Arg Thr Tyr Pro Leu Asp Arg Lys Trp His lie Leu Gly Leu 
545 550 555 560 

Gly Tyr Asn Pro ser Val Asn Gin Arg Asp lie Glu Arg Ala Ala Val 
565 570 575 

lie His Tyr Asn Gly Asn Leu Lys Pro Trp Leu Glu lie Gly lie Pro 
580 585 590 

Arg Tyr Arg Gly Phe Trp ser Lys His val Asp Tyr Glu His Val Tyr 
595 600 605 

Leu Arg Glu Cys Asn lie Asn Pro 
610 615 

<210> 7 
<211> 1770 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 7 

atgaaacaaa ttcgtcgatg gcagaggatt ttgatcctcg ctctgctatc gatatcagta 60 
ttcgctccgc ttattttcgt atcgaatcgg cttaagagca tcactcccgt tggtcgtaga 120 
gaatttattg aagagttatc caaaattaga ttcacgacaa atgaccttag acttagcgct 180 
attgaacatg aggatggaga aggcttgaag gggccaaggc tcattctctt caaggatggg 240 
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gagtttaatt cgtctgctga aagtgatggt ggtaatactt acaaaaacag ggaagaacaa 300 

gtgattgttt cacagaagat gacagttagc tctgatgaaa agggtcaaat tctaccaaca 360 

gtcaaccaac ttgctaataa aacggatttc aagccccctt tatctaaggg tgaaaagaac 420 

acaagggttc agcccgacag agcaacagat gtgaaaacga aggagatcag agacaaaatt 480 

attcaagcta aagcctacct gaatttcgct ccacctggaa gtaactctca agttgtgaag 540 

gagttgagag gtcggctgaa agagctggaa cggtctgttg gtgatgcaac aaaggacaag 600 

gacttatcaa agggcgctct ccgcagggtg aagcccatgg aaaatgtgtt atataaggct 660 

agtcgtgtct ttaacaattg ccctgccatc gctaccaaac tccgtgccat gaattataac 720 

acagaagaac aagttcaggc gcagaaaaat caagcagcgt atctaatgca gcttgcagca 780 

aggaccaccc caaaagggct tcactgtctc tcaatgcggc tgacatcaga atacttttca 840 

ctggatcctg aaaaaaggca gatgcctaac cagcaaaatt attttgacgc taatttcaat 

cattatgttg tcttctctga caatgttttg gcttcttcag tcgttgttaa ctctacgata 

tcttcatcaa aggagccaga aagaatagtc ttccatgtcg tgactgattc acttaattac 1020 

ccagcaatct caatgtggtt tctgctaaac attcaaagta aagctactat ccaaatccta 1080 

aacattgatg atatggatgt cctgcctaga gattatgatc aattactgat gaagcaaaac 1140 

tctaatgacc caagattcat ttctacactc aatcacgcac gcttctatct cccggatata 1200 

ttcccgggtt tgaacaagat ggtactcttg gaccatgatg tagttgttca aagagattta 1260 

agtagactgt ggagcattga tatgaaagga aaggtggttg gagctgtaga gacttgtctt 1320 

gaaggtgaat cttcatttcg atcaatgagc acatttatta atttctcaga cacatgggtc 1380 

gctgggaaat ttagtcctag agcttgcaca tgggctttcg ggatgaatct aattgatctc 1440 

gaagaatgga gaatacggaa gttgacttct acatacataa aatacttcaa cctgggaaca 1500 

aagagaccat tgtggaaagc tgggagctta ccaataggtt ggttgacttt ctataggcaa 1560 

acattagcat tggacaagag atggcatgtg atggggttag gtcgcgaatc aggagtcaaa 1620 

gcggttgaca tcgaacaagc ggcagttata cactacgatg gggtcatgaa gccgtggttg 1680 

gacattggaa aagagaatta caaacgttac tggaacatac acgtccctta ccatcacacc 1740 

tacttgeaac agtgcaatct tcaagcttga 1770 

<210> 8 
<211> 589 

<212> PRT , „ . 

<213> Arabidospsis thai l ana 

<400> 8 

Met Lys Gin He Arg Arg Trp Gin Arg lie Leu He Leu Ala Leu Leu 
1 ' 5 10 15 

Page 13 



WO 2004/072250 PCT/US2004/003545 



14-03 WO. sequence listing.txt 

ser lie ser Val Phe Ala Pro Leu lie Phe Val Ser Asn Arg Leu Lys 
20 25 30 

ser lie Thr Pro val Gly Arg Arg Glu Phe lie Glu Glu Leu Ser Lys 
35 40 45 

lie Arg Phe Thr Thr Asn Asp Leu Arg Leu Ser Ala lie Glu His Glu 
50 55 60 

Asp Gly Glu Gly Leu Lys Gly Pro Arg Leu lie Leu Phe Lys Asp Gly 

Glu Phe Asn ser Ser Ala Glu Ser Asp Gly Gly Asn Thr Tyr Lys Asn 
85 90 95 

Arq Glu Glu Gin Val lie Val ser Gin Lys Met Thr val ser Ser Asp 
100 105 110 

Glu Lys Gly Gin lie Leu Pro Thr val Asn Gin Leu Ala Asn Lys Thr 
115 120 125 

Asd Phe Lys Pro Pro Leu ser Lys Gly Glu Lys Asn Thr Arg Val Gin 
130 135 140 

Pro Asp Arg Ala Thr Asp Val Lys Thr Lys Glu He Arg Asp Lys lie 
145 150 155 160 

lie Gin Ala Lys Ala Tyr Leu Asn Phe Ala Pro Pro Gly Ser Asn Ser 
165 170 175 

Gin val Val Lys Glu Leu Arg Gly Arg Leu Lys Glu Leu Glu Arg Ser 
180 ~ 185 190 

val Gly Asp Ala Thr Lys Asp Lys Asp Leu Ser Lys Gly Ala Leu Arg 
195 200 205 

Arq val Lys pro Met Glu Asn Val Leu Tyr Lys Ala ser Arg Val Phe 
y 210 215 220 

Asn Asn cys Pro Ala lie Ala Thr Lys Leu Arg Ala Met Asn Tyr Asn 
225 230 235 240 

Thr Glu Glu Gin val Gin Ala Gin Lys Asn Gin Ala Ala Tyr Leu Met 
245 250 255 

Gin Leu Ala Ala Arg Thr Thr Pro Lys Gly Leu His cys Leu ser Met 
260 265 270 
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Arq Leu Thr ser Glu Tyr Phe Ser Leu Asp Pro Glu Lys Arg Gin Met 
y 275 280 285 

pro Asn Gin Gin Asn Tyr Phe Asp Ala Asn Phe Asn His Tyr val val 
290 295 300 

Phe Ser Asp Asn Val Leu Ala ser Ser Val Val Val Asn Ser Thr lie 
305 310 315 320 

ser ser Ser Lys Glu Pro Glu Arg lie val Phe His val Val Thr Asp 
325 330 335 

ser Leu Asn Tyr Pro Ala lie ser Met Trp Phe Leu Leu Asn lie Gin 
340 345 350 

ser Lys Ala Thr He Gin lie Leu Asn lie Asp Asp Met Asp val Leu 
355 360 365 

pro Arq Asp Tyr Asp Gin Leu Leu Met Lys Gin Asn Ser Asn Asp Pro 
370 375 380 

Arg Phe lie Ser Thr Leu Asn His Ala Arg Phe Tyr Leu Pro Asp lie 
385 390 395 400 

Phe Pro Gly Leu Asn Lys Met val Leu Leu Asp His Asp Val val val 
405 410 415 

Gin Arg Asp Leu Ser Arg Leu Trp ser lie Asp Met Lys Gly Lys val 
420 425 430 

val Gly Ala val Glu Thr Cys Leu Glu Gly Glu Ser ser Phe Arg Ser 
435 440 445 

Met Ser Thr Phe lie Asn Phe Ser Asp Thr Trp val Ala Gly Lys Phe 
450 455 460 

ser Pro Arg Ala Cys Thr Trp Ala Phe Gly Met Asn Leu lie Asp Leu 
465 470 475 480 

Glu Glu Trp Arg He Arg Lys Leu Thr ser Thr Tyr He Lys Tyr Phe 
485 490 495 

Asn Leu Gly Thr Lys Arg Pro Leu Trp Lys Ala Gly ser Leu Pro lie 
500 505 510 

Gly Trp Leu Thr Phe Tyr Arg Gin Thr Leu Ala Leu Asp Lys Arg Trp 
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515 520 525 

His val Met Gly Leu Gly Arg Glu Ser Gly Val Lys Ala val Asp lie 
530 535 540 

Glu Gin Ala Ala val lie His Tyr Asp Gly val Met Lys Pro Trp Leu 
545 550 555 560 

Asp lie Gly Lys Glu Asn Tyr Lys Arg Tyr Trp Asn lie His val Pro 
565 570 575 

Tyr His His Thr Tyr Leu Gin Gin cys Asn Leu Gin Ala 
580 585 

<210> 9 
<211> 1614 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 9 

atgaggcggt ggccggtgga tcaccggcgg cgaggtagaa ggagattgtc gagttggata 60 

tggtttctcc ttggttcttt ctctgtcgct ggtttagttc tcttcatcgt tcagcattat 120 

caccatcaac aagatccatc ccagctttta cttgagagag acacgagaac cgaaatggta 180 

tctcctcccc atttaaactt cacggaagag gtcacaagtg cttcctcctt ctctaggcag 240 

ttagcagagc aaatgacact tgccaaagct tatgtgttta tagctaaaga gcataataat 300 

cttcatttag cttgggaatt gagttctaag atcagaagtt gtcagctttt gctttccaaa 360 

gcagctatga gaggacaacc tatttcgttt gatgaggcta aaccgattat tactggtcta 420 

tcagctctta tctacaaggc tcaagatgca cattatgata ttgccaccac tatgatgacc 480 

atgaaatctc acatccaagc acttgaagag cgtgcaaatg cagctactgt tcagaccaca 540 

atatttgggc aattggttgc tgaggcatta ccaaagagcc tccactgttt gacgataaag 600 

ctcacatctg attgggtaac agagccatct cgccatgaac tggcagatga gaacagaaac 660 

tcacctagac ttgtcgacaa caacctctac cacttctgca tcttctcgga caacgtgatt 720 

gccacctcgg ttgttgttaa ttcaactgtc tcgaatgctg atcatccaaa gcagcttgtt 780 

ttccacatag tgacgaatcg agtgagctac aaagctatgc aggcctggtt tctaagtaat 840 

gacttcaagg gctcagcaat agagatcagg agcgtagagg agttttcttg gttgaatgct 900 

tcatattctc ctgttgttaa gcaactgctg gacacagatg caagagctta ctatttcggg 960 

gaacagacaa gtcaagatac gatttccgag ccaaaagtga ggaacccaaa gtacttgtca 1020 

ttactgaacc atctcagatt ctacattccg gagatctatc cacagctaga gaagattgtt 1080 

ttcctagacg atgatgttgt tgttcagaaa gatttgactc cactcttctc cttggatctg 1140 
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catggaaacg tcaatggagc tgtggaaaca tgtcttgaag cctttcaccg atattacaag 1200 

tatctaaatt tctcgaaccc actcatcagc tcaaagttcg acccacaagc atgtggatgg 1260 

gcttttggta tgaacgtttt tgatctgatc gcttggagga atgcaaacgt gactgctcgg 1320 

taccattact ggcaagatca gaacagagaa cgaacgcttt ggaaactcgg gacactccct 1380 

ccaggtctac tatctttcta tggtctcaca gagccactgg acagaagatg gcatgtcttg 1440 

ggtttaggtt acgatgtgaa catcgataac cgtctgatcg aaacagcagc tgtgattcac 1500 

tataatggta acatgaagcc ttggctaaag ctggctattg gtaggtataa acctttctgg 1560 

ttaaagtttt tgaactcgag ccatccttat ttacaagatt gtgtcacagc ttaa 1614 

<210> 10 
<211> 537 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 10 

Met Arg Arg Trp Pro Val Asp His Arg Arg Arg Gly Arg Arg Arg Leu 
15 10 15 

Ser ser Trp lie Trp Phe Leu Leu Gly ser Phe Ser Val Ala Gly Leu 
20 25 30 

Val Leu Phe He val Gin His Tyr His His Gin Gin Asp Pro Ser Gin 
35 40 45 

Leu Leu Leu Glu Arg Asp Thr Arg Thr Glu Met val Ser Pro Pro His 
50 " 55 60 

Leu Asn Phe Thr Glu Glu Val Thr Ser Ala Ser Ser Phe Ser Arg Gin 
65 70 75 80 

Leu Ala Glu Gin Met Thr Leu Ala Lys Ala Tyr val Phe lie Ala Lys 
85 90 95 

Glu His Asn Asn Leu His Leu Ala Trp Glu Leu Ser Ser Lys He Arg 
100 105 110 

Ser cys Gin Leu Leu Leu Ser Lys Ala Ala Met Arg Gly Gin Pro He 
115 120 125 

Ser Phe Asp Glu Ala Lys Pro lie lie Thr Gly Leu Ser Ala Leu He 
130 135 140 

Tyr Lys Ala Gin Asp Ala His Tyr Asp He Ala Thr Thr Met Met Thr 
145 150 155 160 

page 17 



WO 2004/072250 PCT/US2004/003545 



14-03 WO. sequence listing.txt 

Met Lys ser His lie Gin Ala Leu Glu Glu Arg Ala Asn Ala Ala Thr 
165 170 175 

val Gin Thr Thr lie Phe Gly Gin Leu val Ala Glu Ala Leu Pro Lys 
180 185 190 

ser Leu His Cys Leu Thr lie Lys Leu Thr ser Asp Trp Val Thr Glu 
195 200 205 

Pro Ser Arg His Glu Leu Ala Asp Glu Asn Arg Asn ser Pro Arg Leu 
210 215 220 

val Asp Asn Asn Leu Tyr His Phe Cys lie Phe Ser Asp Asn val lie 
225 230 235 240 

Ala Thr Ser val Val Val Asn ser Thr val ser Asn Ala Asp His Pro 
245 250 255 

Lys Gin Leu val Phe His lie val Thr Asn Arg val ser Tyr Lys Ala 
y 260 265 270 

Met Gin Ala Trp Phe Leu ser Asn Asp Phe Lys Gly ser Ala lie Glu 
275 280 285 

lie Arg ser val Glu Glu Phe Ser Trp Leu Asn Ala ser Tyr ser Pro 
290 295 300 

val val Lys Gin Leu Leu Asp Thr Asp Ala Arg Ala Tyr Tyr Phe Gly 
305 310 315 320 

Glu Gin Thr ser Gin Asp Thr He Ser Glu Pro Lys Val Arg Asn Pro 
325 330 335 

Lys Tyr Leu ser Leu Leu Asn His Leu Arg Phe Tyr lie Pro Glu He 
340 345 350 

Tyr Pro Gin Leu Glu Lys He val Phe Leu Asp Asp Asp val Val Val 
355 360 365 

.» 

Gin Lys Asp Leu Thr Pro Leu Phe ser Leu Asp Leu His Gly Asn val 
370 375 380 

Asn Gly Ala val Glu Thr Cys Leu Glu Ala Phe His Arg Tyr Tyr Lys 
385 390 395 400 

Tyr Leu Asn Phe ser Asn Pro Leu lie ser ser Lys Phe Asp Pro Gin 
y 405 410 415 
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Ala cys Gly Trp Ala Phe Gly Met Asn val Phe Asp Leu lie Ala Trp 
420 425 430 

Arg Asn Ala Asn val Thr Ala Arg Tyr His Tyr Trp Gin Asp Gin Asn 
435 440 445 

Arg Glu Arg Thr Leu Trp Lys Leu Gly Thr Leu Pro Pro Gly Leu Leu 
450 455 460 

ser Phe Tyr Gly Leu Thr Glu Pro Leu Asp Arg Arg Trp His Val Leu 
465 470 475 480 

Gly Leu Gly Tyr Asp val Asn lie Asp Asn Arg Leu lie Glu Thr Ala 
485 490 " 495 

Ala Val lie His Tyr Asn Gly Asn Met Lys Pro Trp Leu Lys Leu Ala 
500 505 510 

lie Gly Arg Tyr Lys Pro Phe Trp Leu Lys Phe Leu Asn Ser Ser His 
515 520 525 

Pro Tyr Leu Gin Asp Cys val Thr Ala 
530 535 

<210> 11 

<211> 1611 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 11 
atgagaagga 


gaggagggga 


tagtttccgg 


agagctggac ggaggaagat 


ctcgaatgtg 


60 


gtatggtggg 


ttctctctgg 


tattgccctc 


ctgctcttct ttctcattct 


ctccaaagct 


120 


ggtcatattg 


aacctagacc 


ctctattcct 


aagcgacgtt accgtaatga 


caaatttgta 


180 


gagggtatga 


atatgactga 


ggaaatgttg 


agtcctactt ccgttgctcg 


tcaagttaat 


240 


gatcagattg 


ctcttgctaa 


agcttttgtt 


gtcattgcta aagaaagtaa 


gaatcttcag 


300 


tttgcttggg 


acttaagtgc 


tcagatccgt 


aactctcagt tgcttttatc 


gagtgctgct 


360 


actaggagaa 


gtcccttgac 


tgtcttggaa 


tctgagtcta ctattcgtga 


catggctgtt 


420 


ttgttatatc 


aagctcagca 


gcttcactat 


gatagtgcta ctatgattat 


gaggcttaag 


480 


gcctcgattc 


aggctcttga 


agaacaaatg 


agttccgtta gcgagaagag 


ttccaagtat 


540 


ggacagattg 


ctgctgagga 


agtgcctaag 


agtctttact gtcttggtgt 


tcgtctcact 


600 


accgaatggt 


ttcagaattt 


agacttacag 


agaactctta aggaaaggag 


tcgtgttgat 


660 


tcgaaactca 


cggataacag 


tctctaccat 


ttctgtgtgt tttccgataa 
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acttctgttg tggttaattc tactgctctc aattccaagg cccctgagaa agttgtgttt 780 

catcttgtga ctaatgagat caactatgct gcaatgaagg cttggttcgc cattaatatg 840 

gacaacctca gaggagtcac tgtggaggtt cagaagttcg aggatttctc atggctgaat 900 

gcttcctatg ttccggtcct caagcagctg caagactctg atacgcaaag ctattatttc 960 

tctggacaca acgatgatgg gcgcactcca atcaaattca ggaaccccaa gtatctttcc 1020 

atgctcaacc atcttaggtt ctacatccct gaagtgtttc ctgcgctgaa gaaggtggtc 1080 

tttcttgatg atgatgttgt agttcagaag gatctttcat ctctcttttc gatcgattta 1140 

aacaaaaatg tgaacggggc tgttgagacc tgcatggaga ccttccaccg ctaccacaag 1200 

tacttgaact attctcatcc tctcatacgc tcccactttg atccagatgc gtgtgggtgg 1260 

gcgtttggaa tgaacgtctt tgatttagtt gagtggagga agagaaatgt gaccggcata 1320 

taccactact ggcaagaaaa aaacgtggac cggaccttat ggaaactggg aacactacct 1380 

ccaggacttc tgacatttta cgggttaaca gaggcactag aggcgtcctg gcatatcctg 1440 

ggattgggat acacgaatgt ggatgctcgt gtgatagaga aaggagctgt tcttcacttc 1500 

aatgggaact taaagccatg gttgaagatc gggatagaga agtacaaacc tttgtgggag 1560 

agatacgttg attacacttc tccttttatg caacaatgca attttcattg a 1611 

<210> 12 

<211> 536 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 12 

Met Arg Arg Arg Gly Gly Asp ser Phe Arg Arg Ala Gly Arg Arg Lys 
1 5 10 15 

lie ser Asn Val val Trp Trp Val Leu ser Gly lie Ala Leu Leu Leu 
20 25 30 

Phe Phe Leu lie Leu Ser Lys Ala Gly His He Glu Pro Arg Pro Ser 
35 40 45 

lie Pro Lys Arg Arg Tyr Arg Asn Asp Lys Phe val Glu Gly Met Asn 
50 55 60 

Met Thr Glu Glu Met Leu Ser Pro Thr ser val Ala Arg Gin Val Asn 
65 70 75 80 

Asp Gin lie Ala Leu Ala Lys Ala Phe Val Val lie Ala Lys Glu Ser 
85 90 95 



Page 20 



WO 2004/072250 PCT/US2004/003545 



14-03 WO. sequence listing.txt 
Lys Asn Leu Gin Phe Ala Trp Asp Leu ser Ala Gin lie Arg Asn ser 
100 105 110 

Gin Leu Leu Leu ser Ser Ala Ala Thr Arg Arg Ser Pro Leu Thr val 
115 120 125 

Leu Glu ser Glu Ser Thr lie Arg Asp Met Ala Val Leu Leu Tyr Gin 
130 135 140 

Ala Gin Gin Leu His Tyr Asp Ser Ala Thr Met lie Met Arg Leu Lys 
145 150 155 160 

Ala Ser lie Gin Ala Leu Glu Glu Gin Met ser ser Val Ser Glu Lys 
165 170 175 

Ser ser Lys Tyr Gly Gin lie Ala Ala Glu Glu Val Pro Lys Ser Leu 
180 185 190 

Tyr Cys Leu Gly Val Arg Leu Thr Thr Glu Trp Phe Gin Asn Leu Asp 
195 200 205 

Leu Gin Arg Thr Leu Lys Glu Arg Ser Arg val Asp ser Lys Leu Thr 
210 215 220 

Asp Asn Ser Leu Tyr His Phe Cys val Phe Ser Asp Asn lie lie Ala 
225 230 235 240 

Thr ser val val Val Asn Ser Thr Ala Leu Asn ser Lys Ala Pro Glu 
245 250 255 

Lys val Val Phe His Leu Val Thr Asn Glu lie Asn Tyr Ala Ala Met 
260 265 270 

Lys Ala Trp Phe Ala lie Asn Met Asp Asn Leu Arg Gly Val Thr Val 
275 280 285 

Glu Val Gin Lys Phe Glu Asp Phe ser Trp Leu Asn Ala Ser Tyr Val 
290 295 300 

pro Val Leu Lys Gin Leu Gin Asp ser Asp Thr Gin Ser Tyr Tyr Phe 
305 310 315 320 

Ser Gly His Asn Asp Asp Gly Arg Thr Pro lie Lys Phe Arg Asn Pro 
325 330 335 

Lys Tyr Leu Ser Met Leu Asn His Leu Arg Phe Tyr lie Pro Glu Val 
340 345 350 
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Phe Pro Ala Leu Lys Lys val val Phe Leu Asp Asp Asp val Val Val 
355 360 365 

Gin Lys Asp Leu ser Ser Leu Phe Ser lie Asp Leu Asn Lys Asn Val 
370 375 380 

Asn Gly Ala val Glu Thr Cys Met Glu Thr Phe His Arg Tyr His Lys 
385 390 395 400 

Tyr Leu Asn Tyr ser His Pro Leu lie Arg Ser His Phe Asp Pro Asp 
405 410 415 

Ala cys Gly Trp Ala Phe Gly Met Asn Val Phe Asp Leu val Glu Trp 
420 425 430 

Arg Lys Arg Asn val Thr Gly He Tyr His Tyr Trp Gin Glu Lys Asn 
435 440 445 

Val Asp Arg Thr Leu Trp Lys Leu Gly Thr Leu Pro Pro Gly Leu Leu 
450 455 460 

Thr Phe Tyr Gly Leu Thr Glu Ala Leu Glu Ala ser Trp His lie Leu 
465 470 475 480 

Gly Leu Gly Tyr Thr Asn Val Asp Ala Arg val lie Glu Lys Gly Ala 
485 490 495 

val Leu His Phe Asn Gly Asn Leu Lys Pro Trp Leu Lys lie Gly lie 
500 505 510 

Glu Lys Tyr Lys Pro Leu Trp Glu Arg Tyr val Asp Tyr Thr Ser Pro 
515 520 525 

Phe Met Gin Gin Cys Asn Phe His 
530 535 

<210> 13 
<211> 1833 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 13 

atgaatcaag ttcgtcgttg gcagaggatt ctgatcctct cgctgctatt gttatctgtt 60 
ttagctccga ttgttttcgt ttcgaatcgg ctcaagagca tcacttccgt cgatagagga 120 
gaattcattg aagaattatc cgacattaca gataagaccg aggatgaact tagacttact 180 
gctattgaac aggacgaaga aggcttgaag gagcctaaac gtattctgca ggatcgagat 240 
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tttaattctg tggttttgtc aaattcctct gataaaagta atgatactgt gcagtctaat 300 

gagggagacc aaaaaaactt tctctcagaa gttgataagg gaaataatca caaaccaaag 360 

gaggaacaag cagtttcaca gaaaaccaca gtaagctcga atgcggaggt gaaaatttca 420 

gcaagagata ttcaacttaa tcataaaacg gaattccgac ccccttcaag taagagtgaa 480 

aagaatacaa gggttcaact tgaaagagca acagatgaga gggtaaagga gatcagagac 540 

aaaattatcc aagcgaaagc ctatctgaat ttggccctac ctgggaataa ctcccaaatc 600 

gtaaaggagt tgagagttcg aacgaaagag ctggaacggg ctactggtga tactaccaag 660 

gataaatatt tgccaaagag ctctcctaac agattgaagg ccatggaagt tgcgttatac 720 

aaggtcagcc gtgcctttca caactgccct gccattgcta ccaaactcca agccatgact 780 

tataaaaccg aagaacaagc tcgggcgcag aagaaacaag cagcatattt aatgcagctt 840 

gcagcaagga ctaccccaaa agggcttcat tgtctctcaa tgcggttgac aacagaatat 900 

tttaccctgg atcacgaaaa aaggcagctt ttgcaacaaa gttataatga tcctgatctc 960 

taccattacg tagtcttctc tgacaatgtt ttggcctctt cggttgttgt taactctaca 1020 

atctcctcat caaaggaacc ggataaaata gtattccatg tggtgacaga ttcactcaat 1080 

tacccagcaa tctcaatgtg gtttttacta aacccaagtg gcagagcttc aatccaaatc 1140 

ctaaacattg atgaaatgaa tgtcctgcca ttgtaccatg ctgaattgct gatgaagcaa 1200 

aattcaagtg acccaagaat catttcagcg ctcaaccatg cacgcttcta tctcccagat 1260 

atcttcccag gtctaaacaa gatcgtactc ttcgatcatg atgtagtagt gcaaagggat 1320 

ctaactagac tgtggagcct tgatatgacg gggaaagttg ttggagctgt agagacttgt 1380 

cttgaaggtg atccttcata tcgttcgatg gactcattca ttaatttctc agatgcatgg 1440 

gtttctcaga aatttgatcc caaggcttgc acttgggcat tcgggatgaa tctatttgat 1500 

ctcgaagaat ggagaagaca ggagttgact tctgtatacc tgaaatactt cgacctggga 1560 

gtaaaaggac atctgtggaa agcaggggga ttgccagtag gttggttgac ttttttcggg 1620 

caaacgtttc cgttggaaaa gagatggaac gtgggtgggt taggtcacga atcaggactc 1680 

agggcaagcg acatcgaaca agcagcggtt atacactacg acgggatcat gaaaccatgg 1740 

ctggacatcg gtatagacaa gtacaagcgc tactggaaca tacatgtacc ttaccatcac 1800 

cctcacttac aacggtgcaa cattcacgat tga 1833 

<210> 14 
<211> 610 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 14 

Met Asn Gin Val Arg Arg Trp Gin Arg lie Leu lie Leu Ser Leu Leu 
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10 3 



15 



Leu Leu Ser val Leu Ala Pro He val Phe val ser Asn Arg Leu Lys 
20 25 30 



Ser lie Thr Ser val Asp Arg Gly Glu Phe lie Glu Glu Leu ser Asp 
35 40 45 



lie Thr Asp Lys Thr Glu Asp Glu Leu Arg Leu Thr Ala lie Glu Gin 
50 55 60 



asp Glu Glu Gly Leu Lys Glu Pro Lys Arg lie Leu Gin Asp Arg Asp 
65 70 75 80 



Phe Asn Ser val val Leu Ser Asn Ser Ser Asp Lys Ser Asn Asp Thr 
85 90 95 



val Gin Ser Asn Glu Gly Asp Gin Lys Asn Phe Leu Ser Glu Val Asp 
100 105 110 



Lys Gly Asn Asn His Lys Pro Lys Glu Glu Gin Ala Val Ser Gin Lys 
115 120 125 



Thr Thr val ser ser Asn Ala Glu val Lys lie ser Ala Arg Asp He 
130 135 140 



Gin Leu Asn His Lys Thr Glu Phe Arg Pro Pro Ser ser Lys Ser Glu 
145 150 155 160 



Lys Asn Thr Arg val Gin Leu Glu Arg Ala Thr Asp Glu Arg Val Lys 
165 170 175 



Glu lie Arg Asp Lys lie lie Gin Ala Lys Ala Tyr Leu Asn Leu Ala 
180 185 190 



Leu Pro Gly Asn Asn ser Gin lie Val Lys Glu Leu Arg Val Arg Thr 
195 200 205 



Glu Leu Glu Arg Ala Thr Gly Asp Thr Thr Lys Asp Lys Tyr Leu 
210 215 220 



pro Lys Ser Ser Pro Asn Arg Leu Lys Ala Met Glu Val Ala Leu Tyr 
225 230 235 240 



Val Ser Arg Ala Phe His Asn cys Pro Ala lie Ala Thr Lys Leu 



245 



250 
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Gin Ala Met Thr Tyr Lys Thr Glu Glu Gin Ala Arg Ala Gin Lys Lys 
260 * 265 270 

Gin Ala Ala Tyr Leu Met Gin Leu Ala Ala Arg Thr Thr Pro Lys Gly 
275 280 285 

Leu His cys Leu ser Met Arg Leu Thr Thr Glu Tyr Phe Thr Leu Asp 
290 295 300 

His Glu Lys Arg Gin Leu Leu Gin Gin ser Tyr Asn Asp Pro Asp Leu 
305 " 310 315 

Tyr His Tyr val val Phe ser Asp Asn val Leu Ala Ser ser val val 

7 325 330 335 

val Asn ser Thr lie ser ser Ser Lys Glu Pro Asp Lys lie val Phe 
340 345 350 

His val val Thr Asp ser Leu Asn Tyr Pro Ala He Ser Met Trp Phe 
355 360 365 

Leu Leu Asn Pro ser Gly Arg Ala Ser He Gin lie Leu Asn lie Asp 
370 375 380 

Glu Met Asn val Leu Pro Leu Tyr His Ala Glu Leu Leu Met Lys Gin 
385 390 395 «uu 

Asn ser ser Asp Pro Arg He lie Ser Ala Leu Asn His Ala Arg Phe 
405 410 415 

Tyr Leu Pro Asp lie Phe Pro Gly Leu Asn Lys lie val Leu Phe Asp 
' 420 425 430 

His Asp val val Val Gin Arg Asp Leu Thr Arg Leu Trp Ser Leu Asp 
435 440 445 

Met Thr Gly Lys Val Val Gly Ala Val Glu Thr cys Leu Glu Gly Asp 
450 455 460 

pro ser Tyr Arg Ser Met Asp Ser Phe lie Asn Phe Ser Asp Ala Trp 
465 470 475 480 

val ser Gin Lys Phe Asp Pro Lys Ala cys Thr Trp Ala Phe Gly Met 
485 490 4yi 

Asn Leu Phe Asp Leu Glu Glu Trp Arg Arg Gin Glu Leu Thr Ser Val 
500 505 510 
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Tyr Leu Lys Tyr Phe Asp Leu Gly val Lys Gly His Leu Trp Lys Ala 
515 520 525 

Gly Gly Leu Pro Val Gly Trp Leu Thr Phe Phe Gly Gin Thr Phe Pro 
530 535 540 

Leu Glu Lys Arg Trp Asn val Gly Gly Leu Gly His Glu Ser Gly Leu 
545 550 555 560 

Arg Ala ser Asp lie Glu Gin Ala Ala Val lie His Tyr Asp Gly lie 
565 570 575 

Met Lys Pro Trp Leu Asp lie Gly lie Asp Lys Tyr Lys Arg Tyr Trp 
580 585 590 

Asn lie His Val Pro Tyr His His Pro His Leu Gin Arg Cys Asn lie 
595 600 605 

His Asp 
610 

<210> 15 
<211> 1587 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 15 

atgactgatg cttgttgttt gaagggaaac gaggacaaaa tggttcctcg ttttggtcat 60 

ggaacctgga taggaaaagc atttaatgat acaccagaga tgttgcatga aaggagtctg 120 

agacaggaaa aaagattgga aagggctaat gagctgatga atgatgatag tctgcaaaag 180 

cttgagacgg cagccatggc acgttccaga tctgtcgatt ctgcaccact aggaaactac 240 

accatttgga aaaatgaata ccggaggggc aagagttttg aagatatgtt acgtttgatg 300 

caagatcaaa tcatcatggc acgagtttac agtggacttg caaagtttac aaacaatctc 360 

gccttgcacc aagagataga aacacaacta atgaaactag cttgggagga agaatctact 420 

gatattgatc aggagcagag agtacttgac agtataagag acatgggaca aatactggct 480 

agagcacacg agcagctata tgaatgcaag ttggtgacaa ataagttgag agcaatgcta 540 

caaacagttg aagatgaact cgaaaacgag cagacttata taacgttctt gactcagcta 600 

gcttccaagg cactaccaga tgctatccac tgcttgacca tgcgcttgaa tctagagtat 660 

catctcctgc ctttaccgat gagaaatttt ccaaggaggg agaatttgga gaatccaaaa 720 

ctttaccact acgctctctt ctctgataat gtactggctg catcagttgt tgtcaactcc 780 

acagtcatga atgcacagga tccttcaagg catgttttcc accttgtgac tgataagctc 840 
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aactttggag caatgagtat gtggtttctg ttgaaccctc ctggagaagc gaccatccat 900 

gtccaaaggt ttgaagattt tacttggctc aactcatctt actctccagt tttgagtcag 960 

ctcgagtcag cagctatgaa gaagttctac ttcaagacag cgaggtctga atcagttgaa 1020 

tcaggctcag aaaacctcaa gtaccggtac ccgaaataca tgtcaatgct taaccacctg 1080 

aggttctaca tccctaggat cttcccaaag ttggagaaaa tcttgtttgt tgacgatgat 1140 

gtggttgttc agaaggattt aactccccta tggtccattg atcttaaagg gaaagtgaat 1200 

gaaaactttg atcccaagtt ctgcggatgg gcttatggga tgaacatctt cgacctgaaa 1260 

gaatggaaga agaacaacat tacagaaact tatcactttt ggcaaaacct gaacgaaaac 1320 

cggactctat ggaaactagg aacattgcca ccagggctca taacgttcta caatctgaca 1380 

caaccacttc agagaaaatg gcacttactt ggactgggtt atgataaagg aatcgatgtc 1440 

aagaagattg aaagatcagc tgttatacat tacaatggac acatgaaacc atggacagag 1500 

atggggataa gcaagtatca gccatattgg acgaagtaca ccaattttga ccatccttac 1560 

atctttactt gcaggctgtt tgagtga 1587 

<210> 16 
<211> 528 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 16 

Met Thr Asp Ala Cys Cys Leu Lys Gly Asn Glu Asp Lys Met Val Pro 
15 10 15 

Arg Phe Gly His Gly Thr Trp lie Gly Lys Ala Phe Asn Asp Thr Pro 
20 25 30 

Glu Met Leu His Glu Arg Ser Leu Arg Gin Glu Lys Arg Leu Glu Arg 
35 ~ 40 ~ 45 

Ala Asn Glu Leu Met Asn Asp Asp Ser Leu Gin Lys Leu Glu Thr Ala 
50 55 60 

Ala Met Ala Arg ser Arg Ser Val Asp Ser Ala Pro Leu Gly Asn Tyr 
65 70 75 80 

Thr lie Trp Lys Asn Glu Tyr Arg Arg Gly Lys ser Phe Glu Asp Met 
85 90 95 

Leu Arg Leu Met Gin Asp Gin lie lie Met Ala Arg Val Tyr Ser Gly 
100 105 110 

Leu Ala Lys Phe Thr Asn Asn Leu Ala Leu His Gin Glu He Glu Thr 
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115 120 125 

Gin Leu Met Lys Leu Ala Trp Glu Glu Glu ser Thr Asp lie Asp Gin 
130 135 140 

Glu Gin Arg Val Leu Asp Ser lie Arg Asp Met Gly Gin lie Leu Ala 
145 " 150 155 loo 

Arg Ala His Glu Gin Leu Tyr Glu cys Lys Leu val Thr Asn Lys Leu 
* 165 170 175 

Arq Ala Met Leu Gin Thr Val Glu Asp Glu Leu Glu Asn Glu Gin Thr 
180 185 190 

Tyr lie Thr Phe Leu Thr Gin Leu Ala Ser Lys Ala Leu Pro Asp Ala 
195 200 205 

He His cys Leu Thr Met Arg Leu Asn Leu Glu Tyr His Leu Leu Pro 
210 215 220 

Leu Pro Met Arg Asn Phe Pro Arg Arg Glu Asn Leu Glu Asn Pro Lys 
225 230 235 240 

Leu Tyr His Tyr Ala Leu Phe Ser Asp Asn Val Leu Ala Ala ser Val 
245 250 255 

val val Asn Ser Thr val Met Asn Ala Gin Asp Pro ser Arg His Val 
260 265 270 

Phe His Leu val Thr Asp Lys Leu Asn Phe Gly Ala Met ser Met Trp 
275 280 285 

Phe Leu Leu Asn Pro Pro Gly Glu Ala Thr lie His val Gin Arg Phe 
290 295 300 

Glu Asp Phe Thr Trp Leu Asn ser ser Tyr Ser Pro val Leu ser Gin 
305 310 315 320 

Leu Glu ser Ala Ala Met Lys Lys Phe Tyr Phe Lys Thr Ala Arg Ser 
325 330 335 

Glu ser Val Glu Ser Gly ser Glu Asn Leu Lys Tyr Arg Tyr Pro Lys 
340 345 350 

Tyr Met ser Met Leu Asn His Leu Arg Phe Tyr lie Pro Arg lie Phe 
355 360 365 
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Pro Lys Leu Glu Lys He Leu Phe val Asp Asp Asp Val val val Gin 
370 375 380 

Lys Asp Leu Thr Pro Leu Trp Ser lie Asp Leu Lys Gly Lys Val Asn 
385 390 395 400 

Glu Asn Phe Asp Pro Lys Phe Cys Gly Trp Ala Tyr Gly Met Asn lie 
405 410 415 

Phe Asp Leu Lys Glu Trp Lys Lys Asn Asn lie Thr Glu Thr Tyr His 
420 425 430 

Phe Trp Gin Asn Leu Asn Glu Asn Arg Thr Leu Trp Lys Leu Gly Thr 
435 440 445 

Leu Pro Pro Gly Leu lie Thr Phe Tyr Asn Leu Thr Gin Pro Leu Gin 
450 455 460 

Arg Lys Trp His Leu Leu Gly Leu Gly Tyr Asp Lys Gly lie Asp Val 
465 470 475 480 

Lys Lys lie Glu Arg Ser Ala Val He His Tyr Asn Gly His Met Lys 
485 490 495 

Pro Trp Thr Glu Met Gly lie Ser Lys Tyr Gin Pro Tyr Trp Thr Lys 
500 505 510 

Tyr Thr Asn Phe Asp His Pro Tyr lie Phe Thr Cys Arg Leu Phe Glu 
515 520 525 

<210> 17 

<211> 1602 

<212> DNA 

<213> Arabidopsis thai i ana 

< 400 > 17 * . 

atgcagcttc acatatcgcc tagcatgaga agcattacga tatcgagcag caatgagttt 60 

attgatttga tgaagatcaa agtcgcagct cgtcacatct cttaccgaac tctcttccac 120 

actatcttaa tcctcgcttt cttgttacct tttgttttca tcctaaccgc tgttgttacc 180 

cttgaaggtg tcaacaagtg ctcctctttt gattgtttcg ggaggcggct aggaccacgt 240 

cttcttggta ggatagatga ttcagagcag agactagtta gagattttta caaaattcta 300 

aatgaagtaa gcactcaaga aattccagat ggtttaaagc ttccagagtc ttttagtcaa 360 

ctggtttcgg atatgaagaa caaccactat gatgctaaaa catttgccct cgtatttcga 420 

gctatggtag agaagtttga aagggattta agggaatcca aatttgcaga actcatgaac 480 

aagcactttg ctgcaagttc aattccaaaa ggaattcact gtctctcttt aagactaacc 540 
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gatgaatatt cctccaatgc tcatgcccgg agacagcttc cttccccgga gcttctccct 
gttctctcag acaatgctta ccaccatttt gttctagcta cagataatat cttagctgca 
tcggttgtgg tctcatctgc tgttcaatca tcttcaaaac ccgagaaaat tgtcttccat 720 
gttatcacag acaagaaaac ctatgcgggt atgcattctt ggtttgcact caattctgtt 780 
gctcctgcga ttgttgaagt gaaaagcgtt catcagtttg attggttaac aagagagaat 840 
gttccagttc ttgaagctgt ggaaagccat aacagtatca gaaattatta ccatgggaat 
catattgctg gtgcaaacct cagcgaaaca acccctcgaa catttgcttc gaaactgcag 
tcaagaagtc ccaaatacat atctttgctc aaccatctta gaatatatct accagagctt 
tttccgaact tagacaaggt agtgttctta gatgatgata tagtgataca gaaagattta 
tctccgcttt gggatattga ccttaacggg aaggttaatg gagctgtgga gacttgtcga 1140 
ggagaagacg tatgggttat gtcaaagcgt cttaggaact acttcaattt ttctcacccg 1200 
ctcatcgcaa agcatttaga tcccgaagaa tgtgcttggg cttatggaat gaatatcttt 1260 
gatctacgga cttggaggaa gacaaatatc agagaaacgt atcattcttg gcttaaagag 1320 
aatctgaagt cgaatctaac aatgtggaaa cttggaacat tgcctcctgc tctaatagca 1380 
tttaaaggtc atgttcagcc aatagattcc tcttggcata tgcttggatt aggttatcag 1440 
agcaagacca acttagaaaa tgcgaagaaa gctgcagtga ttcattacaa tggccaatca 1500 
aagccgtggc ttgagatagg tttcgagcat ctcagaccat tctggacaaa atatgttaac 1560 
tactccaatg atttcattaa gaattgtcat atcttggaat ag 1602 

<210> 18 

<211> 533 

<212> PRT , „. 

<213> Arabidopsis thai i ana 

<400> 18 

Met Gin Leu His He ser Pro ser Met Arg ser He Thr lie ser ser 
5 10 13 

ser Asn Glu Phe lie Asp Leu Met Lys He Lys Val Ala Ala Arg His 
20 25 30 

He ser Tyr Arg Thr Leu Phe His Thr He Leu He Leu Ala Phe Leu 
• 35 40 45 

Leu Pro Phe val Phe lie Leu Thr Ala Val Val Thr Leu Glu Gly Val 
50 55 60 

Asn Lys cys Ser ser Phe Asp Cys Phe Gly Arg Arg Leu Gly pro Arg 
65 70 75 8W 
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Leu Leu Gly Arg He Asp Asp ser Glu Gin Arg Leu Val Arg Asp Phe 
85 90 95 

Tyr Lys He Leu Asn Glu val ser Thr Gin Glu lie Pro Asp Gly Leu 
y y 100 105 110 

Lvs Leu Pro Glu ser Phe ser Gin Leu val Ser Asp Met Lys Asn Asn 
115 120 125 

His Tyr Asp Ala Lys Thr Phe Ala Leu val Phe Arg Ala Met val Glu 
130 135 140 

Lys Phe Glu Arg Asp Leu Arg Glu Ser Lys Phe Ala Glu Leu Met Asn 
145 150 155 160 

Lys His Phe Ala Ala Ser Ser lie Pro Lys Gly He His cys Leu Ser 
165 170 175 

Leu Arg Leu Thr Asp Glu Tyr ser ser Asn Ala His Ala Arg Arg Gin 
180 185 190 

Leu Pro Ser Pro Glu Leu Leu Pro val Leu Ser Asp Asn Ala Tyr His 
195 200 205 

His Phe val Leu Ala Thr Asp Asn He Leu Ala Ala Ser Val val val 
210 215 220 

ser ser Ala val Gin ser Ser ser Lys Pro Glu Lys lie Val Phe His 
225 230 235 

val He Thr Asp Lys Lys Thr Tyr Ala Gly Met His ser Trp Phe Ala 
245 250 255 

Leu Asn Ser val Ala Pro Ala He Val Glu val Lys ser val His Gin 
260 265 270 

Phe Asp Trp Leu Thr Arg Glu Asn val Pro val Leu Glu Ala val Glu 
275 " 280 285 

ser His Asn ser He Arg Asn Tyr Tyr His Gly Asn His lie Ala Gly 
290 295 300 

Ala Asn Leu ser Glu Thr Thr Pro Arg Thr Phe Ala Ser Lys Leu Gin 
305 310 315 320 

ser Arg ser Pro Lys Tyr lie Ser Leu Leu Asn His Leu Arg lie Tyr 
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325 330 335 

Leu Pro Glu Leu Phe Pro Asn Leu Asp Lys Val val Phe Leu Asp Asp 
340 345 350 

Asp lie val lie Gin Lys Asp Leu Ser Pro Leu Trp Asp lie Asp Leu 
355 360 365 

Asn Gly Lys val Asn Gly Ala Val Glu Thr cys Arg Gly Glu Asp Val 
370 375 380 

Trp Val Met Ser Lys Arg Leu Arg Asn Tyr Phe Asn Phe ser His Pro 
385 390 395 400 

Leu lie Ala Lys His Leu Asp Pro Glu Glu cys Ala Trp Ala Tyr Gly 
405 410 415 

Met Asn He Phe Asp Leu Arg Thr Trp Arg Lys Thr Asn lie Arg Glu 
420 425 430 

Thr Tyr His Ser Trp Leu Lys Glu Asn Leu Lys Ser Asn Leu Thr Met 
435 440 445 

Tro Lys Leu Gly Thr Leu Pro Pro Ala Leu He Ala Phe Lys Gly His 
K 450 455 460 

val Gin Pro lie Asp ser ser Trp His Met Leu Gly Leu Gly Tyr Gin 
465 470 475 480 

Ser Lys Thr Asn Leu Glu Asn Ala Lys Lys Ala Ala Val lie His Tyr 
485 490 495 

Asn Gly Gin Ser Lys Pro Trp Leu Glu He Gly Phe Glu His Leu Arg 
500 505 510 

Pro Phe Trp Thr Lys Tyr val Asn Tyr ser Asn Asp Phe lie Lys Asn 
515 520 525 

Cys His lie Leu Glu 
530 

<210> 19 
<211> 1686 
<212> DNA 

<213> Arabidopsis thai i ana 

atggcggtgg ccttccgtgg aggccgggga ggcgtcggat ccggccaatc taccggactt 60 
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cgtagtttct tctcctaccg gatctttatc tccgctttgt tctcttttct cttcctcgcc 120 

actttctccg tcgttcttaa ctcctctcgt catcagcctc atcaggatca tacattgccg 180 

agtatgggca acgcatatat gcagaggacg tttttggctt tgcaatcgga tccattgaaa 240 

actaggttgg atctgataca caagcaagcc attgatcatt tgacactggt gaatgcgtat 300 

gctgcttacg ctaggaagct aaagcttgat gcttctaagc agcttaagct cttcgaagat 360 

ttggctatca acttctcgga tttgcagtcg aaacctggtt tgaaatctgc tgtgtctgat 420 

aatggtaatg ctcttgagga ggattcgttt aggcagcttg agaaagaagt gaaggataag 480 

gtgaagacag cgaggatgat gatcgttgag tctaaagaga gttatgatac acagcttaaa 540 
atccagaagt tgaaagatac aatctttgct gtccaagaac agttgacaaa ggctaagaaa 
aacggtgcgg ttgctagctt gatttcagcc aagtcggttc ctaaaagtct tcattgtttg 

gccatgaggc ttgtaggaga gaggatctct aatcctgaga agtacaagga tgctccacct 720 

gacccagccg cagaggatcc aactctttac cactatgcga ttttctctga taatgtcatt 780 

gctgtgtctg ttgtggtgag atcggttgtg atgaacgctg aggagccatg gaagcatgtc 840 
ttccatgtgg tgacagatcg gatgaatctc gcagccatga aggtgtggtt taagatgcgt 
cctttggacc gtggtgccca tgttgagatt aaatccgtgg aggatttcaa gttcttaaac 

tcttcctatg cgccggtctt gaggcagctt gagtctgcca agttgcagaa gttttacttt 1020 

gagaatcaag ctgagaacgc aactaaagat tcacataacc tcaagttcaa gaaccccaag 1080 

tatctctcga tgttgaacca tctcagattt tacttaccag agatgtatcc gaagctgaat 1140 

aagattttgt tcttggacga tgatgttgtg gtgcagaaag acgtgactgg tttatggaaa 1200 

atcaacttgg atggcaaggt gaatggagcc gttgagacat gttttggttc ttttcatcga 1260 

tatggtcaat acttaaactt ctctcatcct ttgatcaaag agaactttaa ccccagtgcc 1320 

tgtgcttggg cctttggaat gaacatattc gatctcaatg cctggagacg cgagaagtgc 1380 

accgatcaat accattactg gcagaacctg aatgaagaca gaactctctg gaaattggga 1440 

actctacctc cgggattgat cacattctat tcaaagacga aatcattgga caaatcatgg 1500 

catgtacttg ggttaggcta taacccggga gtgagcatgg acgaaatcag aaatgcagga 1560 

gtgattcatt acaatggaaa catgaaaccg tggctagaca ttgcgatgaa ccaatacaag 1620 

tctctctgga ctaaatatgt tgataacgaa atggagtttg tgcagatgtg caattttggt 1680 

1686 

ctctaa 

<210> 20 
<211> 561 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 20 
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Met Ala val Ala Phe Arg Gly Gly Arg Gly Gly val Gly ser Gly Gin 
1 5 10 15 

ser Thr Gly Leu Arg Ser Phe Phe ser Tyr Arg He Phe lie Ser Ala 
20 "* 25 30 

Leu Phe Ser Phe Leu Phe Leu Ala Thr Phe Ser Val Val Leu Asn ser 
35 40 45 

Ser Arg His Gin Pro His Gin Asp His Thr Leu Pro Ser Met Gly Asn 
50 55 60 

Ala Tyr Met Gin Arg Thr Phe Leu Ala Leu Gin ser Asp Pro Leu Lys 
65 70 75 80 

Thr Arg Leu Asp Leu lie His Lys Gin Ala lie Asp His Leu Thr Leu 
85 90 95 

Val Asn Ala Tyr Ala Ala Tyr Ala Arg Lys Leu Lys Leu Asp Ala Ser 
100 105 110 

Lys Gin Leu Lys Leu Phe Glu Asp Leu Ala He Asn Phe ser Asp Leu 
115 120 125 

Gin Ser Lys Pro Gly Leu Lys Ser Ala val ser Asp Asn Gly Asn Ala 
130 135 140 

Leu Glu Glu Asp ser Phe Arg Gin Leu Glu Lys Glu Val Lys Asp Lys 
145 150 155 160 

Val Lys Thr Ala Arg Met Met lie Val Glu ser Lys Glu ser Tyr Asp 
165 170 175 

Thr Gin Leu Lys lie Gin Lys Leu Lys Asp Thr lie Phe Ala val Gin 
180 185 190 

Glu Gin Leu Thr Lys Ala Lys Lys Asn Gly Ala Val Ala ser Leu lie 
195 200 205 

ser Ala Lys ser Val Pro Lys Ser Leu His Cys Leu Ala Met Arg Leu 
210 215 220 

Val Gly Glu Arg lie ser Asn Pro Glu Lys Tyr Lys Asp Ala Pro Pro 
225 230 235 240 

Asp Pro Ala Ala Glu Asp Pro Thr Leu Tyr His Tyr Ala lie Phe Ser 
p 245 250 255 
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Asp Asn val lie Ala val Ser val val val Arg ser Val val Met Asn 

Ala Glu Glu Pro Trp Lys His Val Phe His val val Thr Asp Arg Met 
275 280 285 

Asn Leu Ala Ala Met Lys Val Trp Phe Lys Met Arg Pro Leu Asp Arg 
290 295 300 

Gly Ala His val Glu lie Lys Ser Val Glu Asp Phe Lys Phe Leu Asn 
305 310 315 320 

ser Ser Tyr Ala Pro val Leu Arg Gin Leu Glu Ser Ala Lys Leu Gin 
325 330 335 

Lys Phe Tyr Phe Glu Asn Gin Ala Glu Asn Ala Thr Lys Asp Ser His 
340 345 350 

Asn Leu Lys Phe Lys Asn pro Lys Tyr Leu Ser Met Leu Asn His Leu 
355 360 365 

Arg Phe Tyr Leu Pro Glu Met Tyr Pro Lys Leu Asn Lys He Leu Phe 
370 375 380 

Leu Asp Asp Asp val val val Gin Lys Asp Val Thr Gly Leu Trp Lgs 

lie Asn Leu Asp Gly Lys val Asn Gly Ala Val Glu Thr Cys Phe Gly 
405 410 415 

ser Phe His Arg Tyr Gly Gin Tyr Leu Asn Phe Ser His Pro Leu He 
420 425 430 

Lys Glu Asn Phe Asn Pro ser Ala Cys Ala Trp Ala Phe Gly Met Asn 
435 440 445 

He Phe Asp Leu Asn Ala Trp Arg Arg Glu Lys Cys Thr Asp Gin Tyr 
450 455 460 

His Tyr Trp Gin Asn Leu Asn Glu Asp Arg Thr Leu Trp Lys Leu Gly 
465 470 475 480 

Thr Leu Pro Pro Gly Leu He Thr Phe Tyr Ser Lys Thr Lys ser Leu 
485 490 495 

Asp Lys ser Trp His val Leu Gly Leu Gly Tyr Asn Pro Gly Val ser 
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500 505 510 

Met Asp Glu He Arg Asn Ala Gly Val lie His Tyr Asn Gly Asn Met 
515 520 525 

Lys Pro Trp Leu Asp lie Ala Met Asn Gin Tyr Lys Ser Leu Trp Thr 
530 535 540 

Lys Tyr Val Asp Asn Glu Met Glu Phe Val Gin Met Cys Asn Phe Gly 
545 550 555 560 

Leu 

<210> 21 

<211> 1680 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 21 ^ cn 

atggctaatc accaccgact tttacgcggc ggcggatctc cggccataat cggtggcaga 60 

atcacactca cagctttcgc ttccactatc gcactcttcc tcttcactct ctccttcttc 120 

ttcgcttcag attctaacga ttctcctgat ctccttcttc ccggtgttga gtactctaat 180 

ggagtcggat ctagaagatc catgttggat atcaaatcgg atccgcttaa gccacggttg 240 

attcagatcc ggaaacaagc tgatgatcat cggtcattag cattagctta tgcttcttac 300 

gcgagaaagc ttaagctcga gaattcgaaa ctcgtcagga tcttcgctga tctttcgagg 360 

aattacacgg atctgattaa caaaccgacg tatcgagctt tgtatgattc tgatggagcc 420 

tcgattgaag aatctgtgct taggcaattt gagaaagaag ttaaggaacg gattaaaatg 480 

actcgtcaag tgattgctga agctaaagag tcttttgata atcagttgaa gattcagaag 540 

ctgaaagata cgattttcgc tgttaacgaa cagttaacta atgctaagaa gcaaggtgcg 600 

ttttcgagtt tgatcgctgc gaaatcgatt ccgaaaggat tgcattgtct tgctatgagg 660 

ctgatggaag agaggattgc tcaccctgag aagtatactg atgaagggaa agatagaccg 720 

cgggagctcg aggatccgaa tctttaccat tacgctatat tttcggataa tgtgattgcg 780 

gcttcggtgg ttgtgaactc tgctgtgaag aatgctaagg agccgtggaa gcatgttttt 840 

cacgttgtga ctgataagat gaatcttgga gctatgcagg ttatgtttaa actgaaggag 900 

tataaaggag ctcatgtaga agttaaagct gttgaggatt atacgttttt gaactcttcg 960 

tatgtgcctg tgttgaagca gttagaatct gcgaatcttc agaagtttta tttcgagaat 1020 

aagctcgaga atgcgacgaa agataccacg aatatgaagt tcaggaaccc caagtattta 1080 

tctatattga atcacttgag gttttattta cccgagatgt acccgaaact acataggata 1140 
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ctgtttttgg acgatgatgt ggttgtgcag aaggatttaa cgggtctgtg ggagattgat 1200 

atggatggga aagtgaatgg agctgtagag acttgttttg ggtcgtttca tcggtacgct 1260 

caatacatga atttctcaca tcctttgatc aaagagaagt ttaatcccaa agcatgtgcg 1320 

tgggcgtatg gaatgaactt ctttgatctt gatgcttgga gaagagagaa gtgcacagaa 1380 

gaatatcact actggcaaaa tctgaacgag aacagggctc tatggaaact ggggacgtta 1440 

ccaccgggac tgatcacctt ttactcaacc acaaagccgc tggacaaatc atggcatgtg 1500 

cttgggctgg gttacaatcc gagcattagc atggatgaga tccgcaacgc tgcagtggta 1560 

cacttcaacg gtaacatgaa gccatggctt gacatagcta tgaaccagtt tcgaccactt 1620 

tggaccaaac acgtcgacta tgacctcgag tttgttcagg cttgcaattt tggcctctga 1680 

<210> 22 

<211> 559 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 22 

Met Ala Asn His His Arg Leu Leu Arg Gly Gly Gly Ser Pro Ala lie 
1 5 10 15 

lie Gly Gly Arg lie Thr Leu Thr Ala Phe Ala Ser Thr He Ala Leu 
20 25 30 

Phe Leu Phe Thr Leu Ser Phe Phe Phe Ala ser Asp ser Asn Asp Ser 
35 40 45 

Pro Asp Leu Leu Leu Pro Gly val Glu Tyr Ser Asn Gly val Gly ser 
50 55 60 

Arg Arg ser Met Leu Asp lie Lys ser Asp Pro Leu Lys Pro Arg Leu 

He Gin lie Arg Lys Gin Ala Asp Asp His Arg Ser Leu Ala Leu Ala 
85 90 95 

Tyr Ala Ser Tyr Ala Arg Lys Leu Lys Leu Glu Asn Ser Lys Leu val 
100 105 110 

Arg lie Phe Ala Asp Leu Ser Arg Asn Tyr Thr Asp Leu lie Asn Lys 
115 120 125 

. pro Thr Tyr Arg Ala Leu Tyr Asp ser Asp Gly Ala ser lie Glu Glu 
130 135 140 



Ser Val Leu Arg Gin Phe Glu Lys Glu Val Lys Glu Arg lie Lys Met 
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145 150 155 160 

Thr Arg Gin Val He Ala Glu Ala Lys Glu ser Phe Asp Asn Gin Leu 
165 170 175 

Lys lie Gin Lys Leu Lys Asp Thr lie Phe Ala Val Asn Glu Gin Leu 
180 185 190 

Thr Asn Ala Lys Lys Gin Gly Ala Phe ser Ser Leu lie Ala Ala Lys 
195 200 205 

Ser lie Pro Lys Gly Leu His Cys Leu Ala Met Arg Leu Met Glu Glu 
210 215 220 

Arg He Ala His Pro Glu Lys Tyr Thr Asp Glu Gly Lys Asp Arg Pro 
225 230 235 240 

Arq Glu Leu Glu Asp Pro Asn Leu Tyr His Tyr Ala lie Phe ser Asp 
* 245 250 255 

Asn val He Ala Ala Ser val Val val Asn Ser Ala val Lys Asn Ala 
260 265 270 

Lys Glu Pro Trp Lys His val Phe His val val Thr Asp Lys Met Asn 
275 280 285 

Leu Gly Ala Met Gin val Met Phe Lys Leu Lys Glu Tyr Lys Gly Ala 
290 295 300 

His val Glu val Lys Ala val Glu Asp Tyr Thr Phe Leu Asn Ser ser 
305 310 315 320 

Tvr Val Pro val Leu Lys Gin Leu Glu ser Ala Asn Leu Gin Lys Phe 
7 325 330 335 

Tyr Phe Glu Asn Lys Leu Glu Asn Ala Thr Lys Asp Thr Thr Asn Met 
340 345 350 

lys Phe Arg Asn Pro Lys Tyr Leu ser lie Leu Asn His Leu Arg Phe 
355 360 365 

Tyr Leu Pro Glu Met Tyr Pro Lys Leu His Arg lie Leu Phe Leu Asp 
370 375 380 

Asp Asp val val val Gin Lys Asp Leu Thr Gly Leu Trp Glu He Asp 
385 390 395 400 
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Met Asp Gly Lys val Asn Gly Ala val Glu Thr Cys Phe Gly ser Phe 

His Arg Tyr Ala Gin Tyr Met Asn Phe Ser His Pro Leu lie Lys Glu 
420 425 430 

Lys Phe Asn Pro Lys Ala Cys Ala Trp Ala Tyr Gly Met Asn Phe Phe 
435 440 445 

Asp Leu Asp Ala Trp Arg Arg Glu Lys Cys Thr Glu Glu Tyr His Tyr 
450 455 460 

Trp Gin Asn Leu Asn Glu Asn Arg Ala Leu Trp Lys Leu Gly Thr Leu 
465 470 475 480 

pro Pro Gly Leu lie Thr Phe Tyr Ser Thr Thr Lys Pro Leu Asp Lys 
485 490 495 

ser Trp His val Leu Gly Leu Gly Tyr Asn Pro Ser He Ser Met Asp 
500 505 510 

Glu lie Arg Asn Ala Ala Val Val His Phe Asn Gly Asn Met Lys Pro 
515 520 525 

Trp Leu Asp He Ala Met Asn Gin Phe Arg Pro Leu Trp Thr Lys His 
530 535 540 

val Asp Tyr Asp Leu Glu Phe val Gin Ala Cys Asn Phe Gly Leu 
545 550 555 

<210> 23 
<211> 1623 
<212> DNA 

<213> Arabidopsis thai i ana 

atgaagtttt acatatcagc gacggggatt aagaaggtta cgatatcaaa tcccggcgtc 60 

ggaatcggta aaggaagcgg aggatgtgcg gctgcagcgg cggcgttagc agcgcggaga 120 

ttctctagtc gcacgttgtt actgttgctg ctgctgctcg ctatcgtcct cccttttatc 180 

ttcgtcaggt tcgcgtttct cgtcctcgaa tctgcctccg tttgcgattc accactcgat 240 

tgcatgggac tcagactttt ccgtgggggc gacacatctc tgaaaattgg ggaagagttg 300 

acacgggctc tagtggaaga gacgacagat catcaggacg ttaatggaag aggaacgaag 360 

ggatcattgg agtcattcga cgaccttgtt aaggagatga cgttaaaacg ccgtgacata 420 

agggcgtttg cttccgtgac taagaagatg ctgttgcaga tggaacgtaa agtccaatca 480 

gcgaaacatc atgagttagt gtactggcat ttagcctctc acggtattcc taaaagcctc 540 
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cattgccttt ccctcagatt aactgaagag tactctgtaa atgcaatggc tcgaatgcgt 600 
ttgcctccgc ctgagtccgt atcacgtctg accgacccat cttttcatca tattgtcctc 660 
ctgactgaca atgtccttgc tgcctctgtc gtcatatcgt ctactgtaca aaacgctgtg 720 
aatcccgaga agtttgtctt tcatattgtt accgataaga aaacctatac ccctatgcat 780 
gcttggtttg ctatcaactc tgcttcatca ccagttgttg aagtaaaggg acttcatcag 840 
tatgattggc ctcaagaagt gaacttcaaa gttagagaga tgctggacat tcaccgctta 900 
atttggagac gacattatca aaatttgaaa gactctgatt ttagttttgt tgagggtact 
catgagcagt ccttgcaagc tctaaatcct agctgccttg cccttttgaa ccatcttcgc 
atttacattc ccaagctttt tccagatctc aacaagatag tgttgttgga tgatgatgta 
gtagtacaga gcgatctttc gtctttatgg gaaacggatc tcaacggtaa agttgttggt 1140 
gctgtcgttg attcgtggtg cggagacaac tgttgccccg gaagaaaata caaagactat 1200 
ttcaacttct cacatccttt gatctcatca aacttagttc aagaagactg tgcttggctt 1260 
tctggtatga atgtctttga tctcaaagcc tggagacaaa ccaatattac tgaagcttac 1320 
tctacatggc taagactcag tgttaggtca ggactacaat tatggcaacc aggggcttta 1380 
ccaccgacat tacttgcttt caaaggactt acacagtctc ttgaaccatc atggcacgtc 1440 
gctggactag gttctcgatc cgtaaaatcc cctcaagaga ttctgaaatc tgcttcggtt 1500 
ttacatttca gcggtccagc aaaaccgtgg ctagagatca gtaaccctga ggtacgatct 1560 
ctttggtata gatacgtaaa ttcctccgac atcttcgtta gaaaatgcaa aatcatgaac 1620 
tga 

<210> 24 
<211> 540 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 24 

Met Lys Phe Tyr lie Ser Ala Thr Gly lie Lys Lys val Thr lie Ser 
15 10 15 

Asn Pro Gly Val Gly lie Gly Lys Gly ser Gly Gly cys Ala Ala Ala 
20 25 30 

Ala Ala Ala Leu Ala Ala Arg Arg Phe ser ser Arg Thr Leu Leu Leu 
35 40 45 

Leu Leu Leu Leu Leu Ala lie val Leu Pro Phe lie Phe val Arg Phe 
50 55 60 



1623 
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Ala Phe Leu Val Leu Glu ser Ala ser Val Cys Asp Ser Pro Leu Asp 
65 70 75 80 



Cys Met Gly Leu Arg Leu Phe Arg Gly Gly Asp Thr ser Leu Lys lie 
85 90 95 

Gly Glu Glu Leu Thr Arg Ala Leu val Glu Glu Thr Thr Asp His Gin 
100 105 110 

Asp val Asn Gly Arg Gly Thr Lys Gly Ser Leu Glu Ser Phe Asp Asp 
115 120 125 

Leu val Lys Glu Met Thr Leu Lys Arg Arg Asp lie Arg Ala Phe Ala 
130 135 140 

ser Val Thr Lys Lys Met Leu Leu Gin Met Glu Arg Lys val Gin Ser 
145 150 155 160 

Ala Lys His His Glu Leu Val Tyr Trp His Leu Ala ser His Gly lie 
165 170 175 

pro Lys Ser Leu His Cys Leu Ser Leu Arg Leu Thr Glu Glu Tyr Ser 
180 185 190 

Val Asn Ala Met Ala Arg Met Arg Leu Pro Pro Pro Glu Ser val Ser 
195 ~ 200 205 

Ara Leu Thr Asp Pro Ser Phe His His lie Val Leu Leu Thr Asp Asn 
y 210 215 220 

val Leu Ala Ala ser val val lie Ser ser Thr val Gin Asn Ala Val 
225 230 235 240 

Asn Pro Glu Lys Phe val Phe His He val Thr Asp Lys Lys Thr Tyr 
245 250 255 

Thr Pro Met His Ala Trp Phe Ala lie Asn Ser Ala Ser ser Pro Val 
260 265 270 

val Glu Val Lys Gly Leu His Gin Tyr Asp Trp Pro Gin Glu val Asn 
275 280 285 

Phe Lys Val Arg Glu Met Leu Asp lie His Arg Leu lie Trp Arg Arg 
290 295 300 

His Tyr Gin Asn Leu Lys Asp Ser Asp Phe ser Phe val Glu Gly Thr 
305 310 315 320 

Page 41 



WO 2004/072250 PCT/US2004/003545 



14-03 WO. sequence listing.txt 

His Glu Gin ser Leu Gin Ala Leu Asn Pro Ser cys Leu Ala Leu Leu 
325 330 335 

Asn His Leu Arg lie Tyr lie Pro Lys Leu Phe Pro Asp Leu Asn Lys 
340 345 350 

lie val Leu Leu Asp Asp Asp val Val val Gin Ser Asp Leu Ser Ser 
355 360 365 

Leu Trp Glu Thr Asp Leu Asn Gly Lys val val Gly Ala val Val Asp 
370 375 380 

ser Trp cys Gly Asp Asn Cys cys Pro Gly Arg Lys Tyr Lys Asp Tyr 
385 390 395 400 

Phe Asn Phe ser His Pro Leu lie ser Ser Asn Leu val Gin Glu Asp 
405 410 415 

cys Ala Trp Leu ser Gly Met Asn val Phe Asp Leu Lys Ala Trp Arg 
420 425 430 

Gin Thr Asn lie Thr Glu Ala Tyr Ser Thr Trp Leu Arg Leu ser Val 
435 440 445 

Arg ser Gly Leu Gin Leu Trp Gin Pro Gly Ala Leu Pro Pro Thr Leu 
450 455 460 

Leu Ala Phe Lys Gly Leu Thr Gin Ser Leu Glu Pro ser Trp His Val 
465 470 475 480 

Ala Gly Leu Gly ser Arg Ser val Lys Ser Pro Gin Glu lie Leu Lys 
485 490 495 

ser Ala ser val Leu His Phe ser Gly Pro Ala Lys Pro Trp Leu Glu 
500 505 510 

He Ser Asn Pro Glu val Arg Ser Leu Trp Tyr Arg Tyr val Asn Ser 
515 520 525 

Ser Asp lie Phe Val Arg Lys cys Lys lie Met Asn 
530 535 540 

<210> 25 
<211> 2043 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 25 
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atgacgacgt tctctacatg cgccgccttt ttatcgctgg tagtagtgct acatgctgtt 60 

catgtcggtg gagccatttt agagtcacaa gcaccccaca gagaacttaa agcttatcgt 120 

ccgctgcaag ataataatct acaggaggtg tatgcttcct cagctgctgc agtgcactac 180 

gatccagatc tgaaagatgt gaacatagtt gcgacataca gtgaccatta cggcaatata 240 

cgccttggta gggtgaaaat gggggatctt tcaccttctt gggttttgga gaatcctgcc 300 

tatcaagtta gccgcaaaac aaaaggttcg cagctagtta taccacggga ttcatttcaa 360 

aatgatactg gaatggaaga taatgcaagc cattctacaa ctaatcagac tgatgaaagc 420 

gaaaatcagt ttccaaacgt ggattttgca agcccagcaa aactgaagcg gcagatttta 480 

cgtcaggaaa ggagaggtca acgaacttta gagctgatcc gacaagaaaa ggaaactgat 540 

gagcagatgc aagaagcagc cattcagaag tcaatgagct ttgaaaactc agtcataggg 600 

aaatacagta tatggaggag agactatgag agcccaaatg ctgatgctat cttgaagctt 660 

atgagagacc agatcataat ggcaaaagca tatgcaaata ttgccaaatc aaaaaatgta 720 

accaatctgt acgttttctt gatgcagcag tgtggagaaa ataaacgtgt tataggtaaa 780 

gcaacctctg atgctgacct tccttcaagc gctcttgatc aagcaaaagc catgggccat 840 

gcactctctc ttgcaaaaga cgagttatat gactgccatg aacttgcaaa aaagttccgg 900 

gccatccttc agtccactga acgcaaagta gatggactga agaaaaaggg aaccttctta 960 

attcagctag ctgccaaaac atttcccaag ccattgcatt gcctgagtct gcagctagcg 1020 

gcagactatt ttattctagg tttcaatgaa gaggatgcag tgaaagagga tgtcagtcaa 1080 

aagaagcttg aagatccttc gctctatcac tatgcgatct tttcggataa cgttctggct 1140 

acatcagtgg tggtgaactc cactgtcttg aatgcaaagg aaccgcagag gcatgtgttc 1200 

catatagtaa ctgacaaact gaattttggt gcaatgaaga tgtggtttcg catcaatgct 1260 

cctgctgatg cgacgattca agttgaaaac ataaatgatt tcaagtggct gaactcctct 1320 

tactgctctg ttctacggca gcttgaatct gcaaggctga aagaatacta tttcaaagca 1380 

aatcatcctt catcaatctc agctggcgca gataatctaa agtaccgcaa cccaaagtat 1440 

ctatcgatgc tgaatcatct cagattctac cttcctgagg tttatccgaa gctggagaag 1500 

attctgtttc tagacgatga cattgtggtg cagaaggacc tggcaccact atgggaaata 1560 

gacatgcaag gaaaagtgaa tggtgcggtg gagacgtgca aggagagctt ccacagattt 1620 

gacaagtacc tcaacttctc aaatccaaag atttcagaga attttgacgc tggtgcttgt 1680 

gggtgggcat ttgggatgaa tatgtttgac ctgaaagagt ggaggaaacg gaacattaca 1740 

gggatatatc actattggca agacttgaat gaagacagaa cactgtggaa gctgggatcg 1800 

ttgccaccgg ggctgataac attttacaac ctgacgtatg caatggatag gagctggcac 1860 

gtactagggc tgggatatga cccagcgcta aaccaaacag caatagagaa tgcagcggta 1920 
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gtgcattaca atgggaacta caagccatgg ctgggtttag cattcgccaa gtacaaaccg 1980 
tactggtcca agtacgttga gtacgacaac ccttatctcc gacggtgcga catcaatgaa 2040 
tga 

<210> 26 
<211> 680 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 26 

Met Thr Thr Phe Ser Thr Cys Ala Ala Phe Leu Ser Leu Val val val 
15 10 15 

Leu His Ala Val His Val Gly Gly Ala lie Leu Glu Ser Gin Ala Pro 
20 25 30 

His Arg Glu Leu Lys Ala Tyr Arg Pro Leu Gin Asp Asn Asn Leu Gin 
35 40 45 

Glu Val Tyr Ala ser ser Ala Ala Ala Val His Tyr Asp Pro Asp Leu 
50 55 60 

Lys Asp val Asn lie val Ala Thr Tyr Ser Asp His Tyr Gly Asn lie 
65 70 75 80 

Arg Leu Gly Arg val Lys Met Gly Asp Leu Ser Pro ser Trp val Leu 
85 90 95 

Glu Asn Pro Ala Tyr Gin Val Ser Arg Lys Thr Lys Gly Ser Gin Leu 
100 105 110 

Val He Pro Arg Asp ser Phe Gin Asn Asp Thr Gly Met Glu Asp Asn 
115 120 125 

Ala ser His ser Thr Thr Asn Gin Thr Asp Glu Ser Glu Asn Gin Phe 
130 135 140 

pro Asn val Asp Phe Ala Ser Pro Ala Lys Leu Lys Arg Gin He Leu 
145 150 155 160 

Arg Gin Glu Arg Arg Gly Gin Arg Thr Leu Glu Leu lie Arg Gin Glu 
165 170 175 

Lys Glu Thr Asp Glu Gin Met Gin Glu Ala Ala lie Gin Lys Ser Met 
y 180 185 190 
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Ser Phe Glu Asn Ser Val lie Gly Lys Tyr Ser lie Trp Arg Arg Asp 
195 200 205 

Tyr Glu ser Pro Asn Ala Asp Ala lie Leu Lys Leu Met Arg Asp Gin 
210 215 220 

lie lie Met Ala Lys Ala Tyr Ala Asn He Ala Lys ser Lys Asn val 
225 230 235 240 

Thr Asn Leu Tyr val Phe Leu Met Gin Gin cys Gly Glu Asn Lys Arg 
245 250 255 

val lie Gly Lys Ala Thr ser Asp Ala Asp Leu Pro Ser Ser Ala Leu 
260 265 270 

Asp Gin Ala Lys Ala Met Gly His Ala Leu Ser Leu Ala Lys Asp Glu 
275 280 285 

Leu Tyr Asp cys His Glu Leu Ala Lys Lys Phe Arg Ala lie Leu Gin 
290 295 300 

Ser Thr Glu Arg Lys val Asp Gly Leu Lys Lys Lys Gly Thr Phe Leu 
305 310 315 320 

lie Gin Leu Ala Ala Lys Thr Phe Pro Lys Pro Leu His cys Leu Ser 
325 330 335 

Leu Gin Leu Ala Ala Asp Tyr Phe lie Leu Gly Phe Asn Glu Glu Asp 
340 345 350 

Ala Val Lys Glu Asp val Ser Gin Lys Lys Leu Glu Asp Pro ser Leu 
355 360 365 

Tyr His Tyr Ala lie Phe Ser Asp Asn val Leu Ala Thr ser val val 
370 375 380 

Val Asn Ser Thr val Leu Asn Ala Lys Glu Pro Gin Arg His Val Phe 
385 390 395 400 

His lie Val Thr Asp Lys Leu Asn Phe Gly Ala Met Lys Met Trp Phe 
405 410 415 

Arg lie Asn Ala Pro Ala Asp Ala Thr lie Gin val Glu Asn lie Asn 
420 425 430 

Asp Phe Lys Trp Leu Asn Ser ser Tyr cys Ser Val Leu Arg Gin Leu 
435 440 445 
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Glu ser Ala Arg Leu Lys Glu Tyr Tyr Phe Lys Ala Asn His Pro Ser 
450 455 460 

ser He Ser Ala Gly Ala Asp Asn Leu Lys Tyr Arg Asn Pro Lys Tyr 
465 470 475 480 

Leu ser Met Leu Asn His Leu Arg Phe Tyr Leu Pro Glu Val Tyr Pro 
485 490 495 

Lys Leu Glu Lys lie Leu Phe Leu Asp Asp Asp He val Val Gin Lys 
500 505 510 

Asp Leu Ala Pro Leu Trp Glu lie Asp Met Gin Gly Lys Val Asn Gly 
515 520 525 

Ala val Glu Thr Cys Lys Glu ser Phe His Arg Phe Asp Lys Tyr Leu 
530 535 540 

Asn Phe ser Asn Pro Lys lie ser Glu Asn Phe Asp Ala Gly Ala Cys 
545 550 555 560 

Gly Trp Ala Phe Gly Met Asn Met Phe Asp Leu Lys Glu Trp Arg Lys 
565 570 575 

Arg Asn He Thr Gly He Tyr His Tyr Trp Gin Asp Leu Asn Glu Asp 
580 585 590 

Arg Thr Leu Trp Lys Leu Gly ser Leu Pro Pro Gly Leu lie Thr Phe 
595 600 605 

Tyr Asn Leu Thr Tyr Ala Met Asp Arg Ser Trp His Val Leu Gly Leu 
610 615 620 

Gly Tyr Asp Pro Ala Leu Asn Gin Thr Ala He Glu Asn Ala Ala Val 
625 630 635 640 

val His Tyr Asn Gly Asn Tyr Lys Pro Trp Leu Gly Leu Ala Phe Ala 
645 650 655 

Lys Tyr Lys Pro Tyr Trp ser Lys Tyr Val Glu Tyr Asp Asn Pro Tyr 
660 665 670 

Leu Arg Arg cys Asp He Asn Glu 
675 680 

<210> 27 
<211> 1599 
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<212> DNA 

<213> Arabidopsis thai i ana 
<400> 27 

atgcagcttc acatatcgcc gagtatgaga agcattacga tttcgagcag caatgagttt 60 
attgacttga tgaagatcaa ggtcgcagct cgtcacatct cttaccgaac tctcttccac 120 
accatcttaa tcctcgcttt cttgttgcct tttgttttca ttctcaccgc tgttgttacc 180 
cttgagggtg tcaacaaatg ctcctccatt gattgtttag ggaggcggat aggtccacgt 240 
cttcttggta gggtagatga ttcagagaga ctagctagag acttttataa aattctaaac 300 
gaagtaagca ctcaagaaat tccagatggt ttgaagcttc caaattcttt tagtcaactt 360 
gtttccgata tgaagaataa ccactatgat gcaaaaacat ttgctcttgt gctgcgagcc 420 
atgatggaga agtttgaacg tgatatgagg gaatcgaaat ttgcagaact tatgaacaag 480 
cactttgcag caagttccat tcccaaaggc attcattgtc tctctctaag actgacagat 
gaatattcct ccaatgctca tgctcgaaga cagcttcctt caccagagtt tctccctgtt 
ctttcagata atgcttacca ccactttatt ttgtccacgg acaatatttt ggctgcctca 
gttgtggtct catccgctgt tcagtcatct tcaaaacccg agaaaattgt ctttcacatc 720 
attacagaca agaaaaccta tgcgggtatg cattcatggt ttgcgcttaa ttctgttgca 780 
ccagcaattg ttgaggttaa aggtgttcat cagtttgact ggttgacgag agagaatgtt 
ccggttttgg aagctgtgga aagccataat ggtgtcaggg actattatca tgggaatcat 
gtcgctgggg caaacctcac cgaaacaact cctcgaacat ttgcttcaaa attgcagtct 
agaagtccaa aatacatatc tttgctcaac catcttagaa tatatatacc agagcttttc 
ccgaacttgg acaaggtggt tttcttagac gatgatatag ttgtccaggg agacttaact 1080 
ccactttggg atgttgacct cggtggtaag gtcaatgggg cagtagagac ttgcaggggt 1140 
gaagatgaat gggtgatgtc aaagcgttta aggaactact tcaatttctc tcacccgctc 1200 
atcgcaaagc atttagatcc tgaagaatgt gcttgggcat atggtatgaa tatcttcgat 1260 
ctacaagctt ggaggaaaac aaatatcaga gaaacgtatc actcttggct tagagagaat 1320 
ctaaagtcaa atctgacaat gtggaaactt ggaaccttgc ctcctgctct tatcgcgttc 1380 
aagggtcacg tacacataat agactcgtca tggcatatgc taggattagg ctaccagagc 1440 
aagaccaaca tagaaaatgt gaagaaagca gcagtgatcc actacaatgg gcagtcaaag 1500 
ccatggctgg agattggttt cgagcatctg cggccattct ggaccaaata cgtcaactac 1560 
tcaaatgatt tcatcaagaa ctgtcacata ttggagtag 1599 

<210> 28 
<2U> 532 
<212> PRT 

<213> Arabidopsis thai i ana 
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<400> 28 

Met Gin Leu His lie Ser Pro ser Met Arg ser lie Thr He Ser Ser 
15 10 15 

ser Asn Glu Phe lie Asp Leu Met Lys He Lys val Ala Ala Arg His 
20 25 30 

He ser Tyr Arg Thr Leu Phe His Thr lie Leu He Leu Ala Phe Leu 
35 40 45 

Leu pro Phe val Phe lie Leu Thr Ala val val Thr Leu Glu Gly val 
50 55 60 

Asn Lys cys ser ser lie Asp cys Leu Gly Arg Arg He Gly Pro Arg 
65 70 75 80 

Leu Leu Gly Arg Val Asp Asp ser Glu Arg Leu Ala Arg Asp Phe Tyr 
85 90 95 

Lys He Leu Asn Glu val Ser Thr Gin Glu lie Pro Asp Gly Leu Lys 
100 105 HO 

Leu Pro Asn ser Phe Ser Gin Leu val Ser Asp Met Lys Asn Asn His 
115 120 125 

Tyr Asp Ala Lys Thr Phe Ala Leu val Leu Arg Ala Met Met Glu Lys 
130 135 140 

Phe Glu Arg Asp Met Arg Glu ser Lys Phe Ala Glu Leu Met Asn Lys 
145 150 155 

His Phe Ala Ala Ser Ser He Pro Lys Gly lie His Cys Leu ser Leu 
165 170 175 

Arg Leu Thr Asg Glu Tyr ser Ser Asn Ala His Ala Arg Arg Gin Leu 

pro ser Pro Glu Phe Leu Pro Val Leu Ser Asp Asn Ala Tyr His His 
195 200 205 

Phe He Leu ser Thr Asp Asn lie Leu Ala Ala Ser val val Val ser 
210 215 220 

ser Ala Val Gin ser ser Ser Lys Pro Glu Lys He val Phe His lie 
225 230 235 *w 
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lie Thr Asp Lys Lys Thr Tyr Ala Gly Met His ser Trp Phe Ala Leu 
245 250 255 

Asn ser Val Ala pro Ala He Val Glu val Lys Gly val His Gin Phe 
260 265 270 

Asp Trp Leu Thr Arg Glu Asn val Pro Val Leu Glu Ala val Glu Ser 
H 275 280 285 

His Asn Gly Val Arg Asp Tyr Tyr His Gly Asn His val Ala Gly Ala 
290 295 300 

Asn Leu Thr Glu Thr Thr Pro Arg Thr Phe Ala Ser Lys Leu Gin Ser 
305 310 315 320 

Arg ser Pro Lys Tyr lie ser Leu Leu Asn His Leu Arg lie Tyr lie 

pro Glu Leu Phe Pro Asn Leu Asp Lys Val val Phe Leu Asp Asp Asp 
340 345 350 

lie val Val Gin Gly Asp Leu Thr Pro Leu Trp Asp Val Asp Leu Gly 
355 360 365 

Gly Lys Val Asn Gly Ala val Glu Thr cys Arg Gly Glu Asp Glu Trp 
y 370 375 380 

val Met ser Lys Arg Leu Arg Asn Tyr Phe Asn Phe Ser His Pro Leu 
385 390 395 400 

He Ala Lys His Leu Asp Pro Glu Glu cys Ala Trp Ala Tyr Gly Met 
405 410 415 

Asn He Phe Asp Leu Gin Ala Trp Arg Lys Thr Asn He Arg Glu Thr 
420 425 430 

Tyr His Ser Trp Leu Arg Glu Asn Leu Lys ser Asn Leu Thr Met Trp 
435 440 445 

Lys Leu Gly Thr Leu Pro Pro Ala Leu lie Ala Phe Lys Gly His val 
y 450 455 460 

His He He Asp ser Ser Trp His Met Leu Gly Leu Gly Tyr Gin ser 
465 470 475 480 

Lys Thr Asn lie Glu Asn Val Lys Lys Ala Ala Val He His Tyr Asn 
y 485 490 495 
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Gly Gin ser Lys Pro Trp Leu Glu lie Gly Phe Glu His Leu Arg Pro 
500 505 510 

Phe Trp Thr Lys Tyr val Asn Tyr ser Asn Asp Phe lie Lys Asn Cys 
515 520 525 

His He Leu Glu 
530 

<210> 29 
<211> 1608 
<212> DNA 

<213> Arabidopsis thai nana 

<400> 29 ,, , cr . 

atgcagttac atatatctcc gagcttgaga catgtgactg tggtcacagg gaaaggattg 60 

agagagttca taaaagttaa ggttggttct agaagattct cttatcaaat ggtgttttac 120 

tctctactct tcttcacttt tcttctccga ttcgtctttg ttctctccac cgttgatact 180 

atcgacggcg atccctctcc ttgctcctct cttgcttgct tggggaaaag actaaagcca 240 

aagcttttag gaagaagggt tgattctggt aatgttccag aagctatgta ccaagtttta 300 

gaacagcctt taagcgaaca agaactcaaa ggaagatcag atatacctca aacacttcaa 360 

gatttcatgt ctgaagtcaa aagaagcaaa tcagacgcaa gagaatttgc tcaaaagcta 420 

aaagaaatgg tgacattgat ggaacagaga acaagaacgg ctaagattca agagtattta 480 

tatcgacatg tcgcatcaag cagcataccg aaacaacttc actgtttagc tcttaaacta 540 

gccaacgaac actcgataaa cgcagcggcg cgtctccagc ttccagaagc tgagcttgtc 600 

cctatgttgg tagacaacaa ctactttcac tttgtcttgg cttcagacaa tattcttgca 660 

gcttcggttg tggctaagtc gttggttcaa aatgctttaa gacctcataa gatcgttctt 720 

cacatcataa cggataggaa aacttatttc ccaatgcaag cttggttctc attgcatcct 780 

ctgtctccag caataattga ggtcaaggct ttgcatcatt tcgattggtt atcgaaaggt 840 

aaagtacccg ttttggaagc tatggagaaa gatcagagag tgaggtctca attcagaggt 900 

ggatcatcgg ttattgtggc taataacaaa gagaacccgg ttgttgttgc tgctaagtta 960 

caagctctca gccctaaata caactccttg atgaatcaca tccgtattca tctaccagag 1020 

ttgtttccaa gcttaaacaa ggttgtgttt ctagacgatg acattgtgat ccaaactgat 1080 

ctttcacctc tttgggacat tgacatgaat ggaaaagtaa atggagcagt ggaaacatgt 1140 

agaggagaag acaagtttgt gatgtcaaag aagttcaaga gttacctcaa cttctcgaat 1200 

ccgacaattg ccaaaaactt caatccagag gaatgtgcat gggcttatgg aatgaatgtt 1260 

ttcgacctag cggcttggag gaggactaac ataagctcca cttactatca ttggcttgac 1320 
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gagaacttaa aatcagacct gagtttgtgg cagctgggaa ctttgcctcc tgggctgatt 1380 

gctttccacg gtcatgtcca aaccatagat ccgttctggc atatgcttgg tctcggatac 1440 

caagagacca cgagctatgc cgatgctgaa agtgccgctg ttgttcattt caatggaaga 1500 

gctaagcctt ggctggatat agcatttcct catctacgtc ctctctgggc taagtatctt 1560 

gattcttctg acagatttat caagagctgt cacattagag catcatga 1608 

<210> 30 

<211> 535 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 30 

Met Gin Leu His He Ser Pro Ser Leu Arg His val Thr val yal Thr 
1 5 10 15 

Gly Lys Gly Leu Arg Glu Phe He Ly^s Val Lys val Gly Ser Arg Arg 

Phe Ser Tyr Gin Met val Phe Tyr Ser Leu Leu Phe Phe Thr Phe Leu 
35 40 45 

Leu Arg Phe Val Phe val Leu Ser Thr val Asp Thr lie Asp Gly Asp 
50 55 60 

pro Ser Pro cys Ser Ser Leu Ala Cys Leu Gly Lys Arg Leu Lys Pro 
65 70 75 80 

Lys Leu Leu Gly Arg Arg val Asp Ser Gly Asn val Pro Glu Ala Met 
85 90 95 

Tyr Gin val Leu Glu Gin Pro Leu Ser Glu Gin Glu Leu Lys Gly Arg 
100 105 110 

ser Asp lie Pro Gin Thr Leu Gin Asp Phe Met ser Glu Val Lys Arg 
115 120 125 

ser Lys Ser Asp Ala Arg Glu Phe Ala Gin Lys Leu Lys Glu Met Val 
130 135 140 

Thr Leu Met Glu Gin Arg Thr Arg Thr Ala Lys lie Gin Glu Tyr Leu 
145 150 155 160 

Tyr Arg His val Ala Ser Ser Ser lie pro Lys Gin Leu His Cys Leu 
165 170 175 

Ala Leu Lys Leu Ala Asn Glu His ser lie Asn Ala Ala Ala Arg Leu 
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180 185 190 

Gin Leu Pro Glu Ala Glu Leu Val Pro Met Leu val Asp Asn Asn Tyr 
195 200 205 

Phe His Phe Val Leu Ala ser Asp Asn He Leu Ala Ala Ser Val Val 
210 215 220 

Ala Lys ser Leu val Gin Asn Ala Leu Arg Pro His Lys lie Val Leu 
225 230 . 235 240 

His lie He Thr Asp Arg Lys Thr Tyr Phe Pro Met Gin Ala Trp Phe 
245 250 255 

Ser Leu His Pro Leu ser Pro Ala lie lie Glu val Lys Ala Leu His 
260 265 270 

His Phe Asp Trp Leu Ser Lys Gly Lys val Pro val Leu Glu Ala Met 
275 280 285 

Glu Lys Asp Gin Arg val Arg ser Gin Phe Arg Gly Gly ser ser Val 
290 295 300 

lie val Ala Asn Asn Lys Glu Asn Pro Val Val Val Ala Ala Lys Leu 
305 310 315 320 

Gin Ala Leu Ser Pro Lys Tyr Asn ser Leu Met Asn His lie Arg lie 
325 330 335 

His Leu Pro Glu Leu Phe Pro Ser Leu Asn Lys val Val Phe Leu Asp 
340 345 350 

Asp Asp lie val lie Gin Thr Asp Leu ser Pro Leu Trp Asp lie Asp 
355 360 365 

Met Asn Gly Lys val Asn Gly Ala val Glu Thr cys Arg Gly Glu Asp 
370 375 380 

Lys Phe val Met ser Lys Lys Phe Lys ser Tyr Leu Asn Phe ser Asn 
385 390 395 400 

Pro Thr He Ala Lys Asn Phe Asn Pro Glu Glu cys Ala Trp Ala Tyr 
405 410 415 

Gly Met Asn Val Phe Asp Leu Ala Ala Trp Arg Arg Thr Asn lie Ser 
420 425 430 
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ser Thr Tyr Tyr His Trp Leu Asp Glu Asn Leu Lys Ser Asp Leu Ser 

Leu Trp Gin Leu Gly Thr Leu Pro Pro Gly Leu He Ala Phe His Gly 
450 455 460 

His val Gin Thr lie Asp Pro Phe Trp His Met Leu Gly Leu Gly Tyr 
465 470 475 «eu 

Gin Glu Thr Thr Ser Tyr Ala Asp Ala Glu ser Ala Ala val val His 
485 490 4y 5 

Phe Asn Gly Arg Ala Lys Pro Trp Leu Asp He Ala Phe Pro His Leu 
500 505 510 

Arg pro Leu Trp Ala Lys Tyr Leu Asp Ser ser Asp Arg Phe He Lys 
515 520 525 

Ser cys His lie Arg Ala Ser 
530 535 

<210> 31 
<211> 1086 
<212> DNA 

<213> Arabidopsis thai i ana 

atgcattgg"a ttacgagatt ctctgctttc ttctccgccg cattagccat gattctcctt 60 
tctccttcgc tccaatcctt ttctccggcg gcagctatcc gatcatctca cccctacgcc 120 
gacgaattca aaccccaaca aaactccgat tactcctcct tcagagaatc tccaatgttc 



180 



300 
360 



cgtaacgccg aacaatgcag atcttccggc gaagattccg gcgtctgtaa ccctaatctc 240 

gtccacgtag ccatcactct cgacatcgat tacctccgtg gctcaatcgc agccgtcaat 

tcgatcctcc agcactcaat gtgccctcaa agcgtcttct tccacttcct cgtctcctcc 

gagtctcaaa acctagaatc tctgattcgt tctactttcc ccaaattgac gaatctcaaa 420 

atttactatt ttgcccctga gaccgtacag tctttgattt catcttccgt gagacaagcc 480 

ctagagcaac cgttgaatta cgccagaaat tacttggcgg atctgctcga gccttgcgtt 540 

aagcgagtca tctacttgga ttcggatctc gtcgtcgtcg atgatatcgt caagctttgg 600 

aaaacgggtt taggccagag aacaatcgga gctccggagt attgtcacgc gaatttcacg 660 

aaatacttca ccggaggttt ttggtcagat aagaggttta acgggacgtt caaagggagg 720 

aacccttgtt acttcaatac tggtgtaatg gtgattgatt tgaagaagtg gagacaattt 780 

aggttcacga aacgaattga gaaatggatg gagattcaga agatagagag gatttatgag 840 

cttggttctc ttcctccgtt tcttctggta tttgctggtc atgtagctcc gatttcacat 900 
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cggtggaatc aacatgggct tggtggtgat aatgttagag gtagttgccg tgatttgcat 960 
tctggtcctg tgagtttgct tcactggtca ggtagtggta agccatggtt aagactcgat 1020 
tccaagcttc catgtccttt agacacattg tgggcacctt atgatttgta taaacactcc 1080 
cattga 

<210> 32 
<211> 361 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 32 

Met His Trp He Thr Arg Phe ser Ala Phe Phe Ser Ala Ala Leu Ala 
15 10 15 

Met lie Leu Leu ser Pro ser Leu Gin ser Phe Ser Pro Ala Ala Ala 
20 25 30 

lie Arg ser Ser His Pro Tyr Ala Asp Glu Phe Lys Pro Gin Gin Asn 
35 40 45 

ser Asp Tyr ser Ser Phe Arg Glu ser Pro Met Phe Arg Asn Ala Glu 
50 55 60 

Gin cys Arg Ser ser Gly Glu Asp ser Gly Val Cys Asn Pro Asn Leu 
65 70 75 80 

val His Val Ala lie Thr Leu Asp He Asp Tyr Leu Arg Gly ser lie 
85 90 95 

Ala Ala val Asn ser lie Leu Gin His ser Met Cys Pro Gin Ser val 
100 105 110 

Phe Phe His Phe Leu val Ser ser Glu ser Gin Asn Leu Glu ser Leu 
115 120 125 

lie Arq ser Thr Phe Pro Lys Leu Thr Asn Leu Lys lie Tyr Tyr Phe 
130 135 140 

Ala Pro Glu Thr Val Gin Ser Leu lie Ser Ser Ser Val Arg Gin Ala 
145 150 155 160 

Leu Glu Gin Pro Leu Asn Tyr Ala Arg Asn Tyr Leu Ala Asp Leu Leu 
165 170 175 

Glu Pro Cys val Lys Arg val lie Tyr Leu Asp Ser Asp Leu val Val 
180 185 190 
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val Asp Asp He Val Lys Leu Trp Lys Thr Gly Leu Gly Gin Arg Thr 
195 200 205 

lie Gly Ala Pro Glu Tyr Cys His Ala Asn Phe Thr Lys Tyr Phe Thr 
210 215 220 

Gly Gly Phe Trp Ser Asp Lys Arg Phe Asn Gly Thr Phe Lys Gly Arg 
225 230 235 240 

Asn Pro Cys Tyr Phe Asn Thr Gly val Met Val lie Asp Leu Lys Lys 
245 250 255 

Trp Arg Gin Phe Arg Phe Thr Lys Arg lie Glu Lys Trp Met Glu lie 
260 265 270 

Gin Lys lie Glu Arg lie Tyr Glu Leu Gly ser Leu Pro Pro Phe Leu 
275 280 285 

Leu val Phe Ala Gly His Val Ala Pro lie ser His Arg Trp Asn Gin 
290 295 300 

His Gly Leu Gly Gly Asp Asn Val Arg Gly Ser cys Arg Asp Leu His 
305 310 315 320 

ser Gly Pro Val Ser Leu Leu His Trp Ser Gly Ser Gly Lys Pro Trp 
325 330 335 

Leu Arg Leu Asp ser Lys Leu Pro Cys Pro Leu Asp Thr Leu Trp Ala 
340 345 350 

Pro Tyr Asp Leu Tyr Lys His ser His 
355 360 

<210> 33 
<211> 1038 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 33 

atgtcttctc tgcgtttgcg tttatgtctt cttctactct tacctatcac aattagctgc 60 

gtcacagtca ctctcactga cctccccgcg tttcgtgaag ctccggcgtt tcgaaacggc 120 

agagaatgct ccaaaacgac atggatacct tcggatcacg aacacaaccc atcaatcatc 180 

cacatcgcta tgactctcga cgcaatttac ctccgtggct cagtcgccgg cgtcttctcc 240 

gttctccaac acgcttcttg tcctgaaaac atcgttttcc acttcatcgc cactcaccgt 300 

cgcagcgccg atctccgccg cataatctcc tcaacattcc catacctaac ctaccacatt 360 
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taccattttg accctaacct cgtccgcagc aaaatatctt cctctattcg tcgtgcttta 420 

gaccaaccgt taaactacgc tcggatctac ctcgccgatc tcctcccaat cgccgtccgc 480 

cgcgtaatct acttcgactc cgatctcgta gtcgtcgatg acgtggctaa actctggaga 540 

atcgatctac gtcggcacgt cgtcggagct ccggagtact gtcacgcgaa tttcactaac 600 

tacttcactt caagattctg gtcgagtcaa ggttacaaat cggcgttgaa agataggaaa 660 

ccgtgttatt tcaacaccgg agtgatggtg attgatctcg gaaaatggag agaaaggaga 720 

gtcacggtga agctagagac atggatgagg attcaaaaac gacatcgtat ttacgaattg 780 

ggatctttgc ctccgtttct gctcgttttc gccggagatg ttgagccggt ggagcatagg 840 

tggaatcagc atggtcttgg tggtgataac ttggaaggac tttgccggaa tttgcatcca 900 

ggtccggtga gtttgttgca ttggagcggg aaagggaaac catggctaag gcttgactcg 960 

agacgaccgt gtccgttgga ttcgttatgg gctccttatg atttgtttcg ttattcaccg 1020 

ttgatctctg atagctga 1038 

<210> 34 

<211> 345 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 34 

Met Ser Ser Leu Arg Leu Arg Leu Cys Leu Leu Leu Leu Leu Pro He 
1 5 10 15 

Thr He ser cys val Thr val Thr Leu Thr Asp Leu Pro Ala Phe Arg 
20 25 30 

Glu Ala Pro Ala Phe Arg Asn Gly Arg Glu cys Ser Lys Thr Thr Trp 
35 40 45 

lie Pro Ser Asp His Glu His Asn Pro Ser He lie His lie Ala Met 
50 55 60 

Thr Leu Asp Ala lie Tyr Leu Arg Gly Ser val Ala Gly val Phe ser 
65 70 75 80 

val Leu Gin His Ala ser Cys Pro Glu Asn lie val Phe His Phe lie 
85 90 95 

Ala Thr His Arg Arg Ser Ala Asp Leu Arg Arg lie lie Ser Ser Thr 
100 ~ 105 110 

Phe Pro Tyr Leu Thr Tyr His lie Tyr His Phe Asp Pro Asn Leu Val 
115 120 125 
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Arg Ser Lys lie ser Ser Ser lie Arg Arg Ala Leu Asp Gin Pro Leu 
130 135 140 

Asn Tyr Ala Arg lie Tyr Leu Ala Asp Leu Leu Pro lie Ala Val Arg 
145 150 155 160 

Arg val lie Tyr Phe Asp ser Asp Leu val val val Asp Asp Val Ala 
165 170 175 

Lys Leu Trp Arg lie Asp Leu Arg Arg His Val val Gly Ala Pro Glu 
180 185 190 

Tyr cys His Ala Asn Phe Thr Asn Tyr Phe Thr Ser Arg Phe Trp ser 
195 200 205 

ser Gin Gly Tyr Lys Ser Ala Leu Lys Asp Arg Lys Pro cys Tyr Phe 
210 215 220 

Asn Thr Gly val Met val lie Asp Leu Gly Lys Trp Arg Glu Arg Arg 
225 230 235 240 

Val Thr val Lys Leu Glu Thr Trp Met Arg lie Gin Lys Arg His Arg 
245 250 255 

lie Tyr Glu Leu Gly ser Leu Pro Pro Phe Leu Leu val Phe Ala Gly 
260 265 270 

Asp val Glu Pro Val Glu His Arg Trp Asn Gin His Gly Leu Gly Gly 
275 280 285 

Asp Asn Leu Glu Gly Leu Cys Arg Asn Leu His Pro Gly Pro val ser 
290 295 300 

Leu Leu His Trp Ser Gly Lys Gly Lys Pro Trp Leu Arg Leu Asp ser 
305 310 315 320 

Arg Arg Pro Cys Pro Leu Asp Ser Leu Trp Ala Pro Tyr Asp Leu Phe 
325 330 335 

Arg Tyr Ser Pro Leu lie Ser Asp Ser 
340 345 

<210> 35 
<211> 1056 
<212> DNA 

<213> Arabidopsis thai i ana 
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atgtcccaic atcttcttct tctcattctc ctctcgctac ttcttcttca taaacccatt 60 
tccgccacta caattattca aaaattcaaa gaagccccac agttttacaa ttctgcagat 120 
tgccccttaa tcgatgactc cgagtccgac gatgacgtgg tcgccaaacc aatcttctgc 180 
tcacgtcgag ctgtccacgt ggcgatgaca ctcgacgccg cctacattcg tggctcagtc 240 
gccgctgttc tctccgtcct ccaacactct tcttgtcctg aaaacattgt tttccacttc 
gtcgcctctg cttccgccga cgcttcttcc ttacgagcca ccatatcctc ctctttccct 
taccttgatt tcaccgtcta cgtcttcaac gtctcctccg tctctcgcct tatctcctcc 
tctatccgct ccgcactaga ctgtccttta aactacgcaa gaagctacct cgccgatctc 
ctccctccct gcgtccgccg cgtcgtctac ctagactccg atctgatcct cgtcgacgac 540 
atagcaaaac tcgccgccac agatctcggc cgtgattcag tcctcgccgc gccggaatac 
tgcaacgcca atttcacttc atacttcaca tcaaccttct ggtctaatcc gactctctct 
ttaaccttcg ccgatcggaa agcatgctac ttcaacactg gagtcatggt gatcgatctt 720 
tcccggtggc gcgaaggcgc gtacacgtca cgcatcgaag agtggatggc gatgcaaaag 780 
agaatgagaa tttacgagct tggttcgtta ccaccgtttt tattggtttt tgccggtttg 
attaaaccgg ttaatcatcg gtggaaccaa cacggtttag gaggtgataa tttcagagga 
ctgtgtagag atctccatcc tggtccggtg agtctgttgc attggagtgg gaaaggtaag 960 
ccatgggcta ggcttgatgc tggtcggcct tgtcctttag acgcgctttg ggctccgtat 1020 
gatcttcttc aaacgccgtt cgcgttggat tcttga 1056 

<210> 36 

<211> 351 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 36 

Met ser Gin His Leu Leu Leu Leu He Leu Leu ser Leu Leu Leu Leu 
1 5 10 " 

His Lys Pro He ser Ala Thr Thr He lie Gin Lys Phe Lys Glu Ala 
20 25 30 

pro Gin Phe Tyr Asn ser Ala Asp cys Pro Leu lie Asp Asp ser Glu 
35 40 45 

ser Asp Asp Asp val val Ala Lys Pro He Phe cys ser Arg Arg Ala 
50 55 60 

val His val Ala Met Thr Leu Asp Ala Ala Tyr lie Arg Gly Ser Val 
65 70 75 80 
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Ala Ala val Leu ser val Leu Gin His Ser Ser Cys Pro Glu Asn lie 
85 90 95 

Val Phe His Phe val Ala Ser Ala ser Ala Asp Ala ser Ser Leu Arg 
100 105 110 

Ala Thr lie ser Ser Ser Phe Pro Tyr Leu Asp Phe Thr val Tyr Val 
115 120 125 

Phe Asn val ser ser Val ser Arg Leu lie ser ser ser lie Arg Ser 
130 135 ™ 140 

Ala Leu Asp Cys Pro Leu Asn Tyr Ala Arg ser Tyr Leu Ala Asp Leu 
145 150 155 160 

Leu Pro Pro Cys val Arg Arg val val Tyr Leu Asp Ser Asp Leu He 
165 170 175 

Leu val Asp Asp lie Ala Lys Leu Ala Ala Thr Asp Leu Gly Arg Asp 
180 185 190 

ser val Leu Ala Ala Pro Glu Tyr cys Asn Ala Asn Phe Thr Ser Tyr 
195 200 205 

Phe Thr Ser Thr Phe Trp Ser Asn Pro Thr Leu Ser Leu Thr Phe Ala 
210 215 220 

Asp Arg Lys Ala Cys Tyr Phe Asn Thr Gly val Met val lie Asp Leu 
225 230 235 240 

ser Arg Trp Arg Glu Gly Ala Tyr Thr Ser Arg lie Glu Glu Trp Met 
245 250 255 

Ala Met Gin Lys Arg Met Arg lie Tyr Glu Leu Gly Ser Leu Pro Pro 
260 265 270 

Phe Leu Leu Val Phe Ala Gly Leu lie Lys Pro Val Asn His Arg Trp 
275 280 285 

Asn Gin His Gly Leu Gly Gly Asp Asn Phe Arg Gly Leu Cys Arg Asp 
290 295 300 

Leu His Pro Gly Pro val Ser Leu Leu His Trp ser Gly Lys Gly Lys 
305 310 315 320 

pro Trp Ala Arg Leu Asp Ala Gly Arg Pro cys Pro Leu Asp Ala Leu 
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325 330 335 

Trp Ala Pro Tyr Asp Leu Leu Gin Thr Pro Phe Ala Leu Asp Ser 
340 345 350 

<210> 37 

<211> 1182 

<212> DNA . 

<213> Arabidopsis thai l ana 

atgtcgtcgc gtttttcttt gacggtggtg tgtttgattg ctctgttacc gtttgttgtt 60 
ggtatacggt tgattccggc gaggatcacg agtgtcggtg atggcggcgg cggaggaggt 120 
aataatgggt ttagtaaact tggtccgttt atggaagctc cggagtatag aaacggcaag 180 
gagtgtgtat cttcatcagt gaacagagag aacttcgtgt cgtcttcttc tagttctaat 240 
gatccttcgc ttgttcacat cgctatgact ttggactcag agtatctccg tggatcaatc 
gcagccgttc attctgttct tcgccacgcg tcttgtccag agaacgtctt cttccatttc 
atcgctgctg agtttgactc tgcgagtcct cgtgttctga gtcaactcgt gaggtcgact 
tttccttcgt tgaactttaa agtctacatt tttagggaag atacggtgat caatctcata 
tcttcttcga ttagactagc tttggagaat ccgttgaact atgctcggaa ctatctcgga 540 
gatattcttg atcgaagtgt tgaacgagtc atttatcttg actcggatgt tataactgtg 
gatgatatca caaagctttg gaacacggtt ttgaccgggt cacgagtcat cggagctccg 
gagtattgtc acgcgaactt cactcagtat ttcacttccg ggttctggtc agacccggct 
ttaccgggtc taatctcggg tcaaaagcct tgctatttca acacaggagt gatggtgatg 
gatcttgtta gatggagaga agggaattac agagagaagt tagagcaatg gatgcaattg 
cagaagaaga tgagaatcta cgatcttgga tcattaccac cgtttctttt ggtgtttgcg 
ggtaatgttg aagctattga tcatagatgg aaccaacatg gtttaggagg agacaatata 
cgaggaagtt gtcggtcatt gcatcctggt cctgtgagct tgttgcattg gagtggtaaa 1020 
ggtaagccat gggttagact tgatgagaag aggccttgtc cgttggatca tctttgggag 1080 
ccatatgatt tgtataagca taagattgag agagctaaag atcagtctct gcttgggttt 
gcttctctgt cggagttgac tgatgattca agcttcttgt ga 

<210> 38 

<211> 393 

<212> PRT , , . 

<213> Arabidopsis thai nana 

<400> 38 

Met ser ser Arg Phe ser Leu Thr Val Val Cys Leu He Ala Leu Leu 
1 5 10 
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l 

Pro Phe val val Gly He Arg Leu He Pro Ala Arg lie Thr ser Val 
20 25 30 

Gly Asp Gly Gly Gly Gly Gly Gly Asn Asn Gly Phe ser Lys Leu Gly 

Pro Phe Met Glu Ala Pro Glu Tyr Arg Asn Gly Lys Glu Cys val ser 
50 55 60 

ser ser Val Asn Arg Glu Asn Phe val ser ser ser ser ser ser Asn 
65 70 75 80 

Asd Pro Ser Leu Val His He Ala Met Thr Leu Asp Ser Glu Tyr Leu 
85 90 95 

Arq Gly ser He Ala Ala val His Ser val Leu Arg His Ala Ser Cys 
100 105 HO 

pro Glu Asn val Phe Phe His Phe He Ala Ala Glu Phe Asp ser Ala 
115 120 125 

ser Pro Arg val Leu ser Gin Leu val Arg Ser Thr Phe Pro ser Leu 
130 "* 135 140 

Asn Phe Lys val Tyr lie Phe Arg Glu Asp Thr Val lie Asn Leu lie 
145 150 155 

ser ser Ser He Arg Leu Ala Leu Glu Asn Pro Leu Asn Tyr Ala Arg 
165 170 175 

Asn Tyr Leu Gly Asp He Leu Asp Arg Ser val Glu Arg Val He Tyr 
180 185 190 

Leu Asp ser Asp val He Thr val Asp Asp lie Thr Lys Leu Trp Asn 
195 200 205 

Thr val Leu Thr Gly Ser Arg val He Gly Ala Pro Glu Tyr cys His 
210 215 220 

Ala Asn Phe Thr Gin Tyr Phe Thr Ser Gly Phe Trp Ser Asp Pro Ala 
225 230 235 #w 

Leu Pro Gly Leu He Ser Gly Gin Lys Pro Cys Tyr Phe Asn Thr Gly 
245 250 255 

val Met val Met Asp Leu val Arg Trp Arg Glu Gly Asn Tyr Arg Glu 
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260 265 270 

lvs Leu Glu Gin Trp Met Gin Leu Gin Lys Lys Met Arg lie Tyr Asp 
' 275 280 285 

Leu Gly ser Leu Pro Pro Phe Leu Leu Val Phe Ala Gly Asn val Glu 
290 295 300 

Ala He Asp His Arg Trp Asn Gin His Gly Leu Gly Gly Asp Asn lie 
305 310 315 



Arg Gly Ser Cys Arg ser Leu His Pro Gly Pro Val ser Leu L|u 
325 



His 



Trp Ser Gly Lys Gly Lys Pro Trp val Arg Leu Asp Glu Lys Arg Pro 
340 * 345 

cys Pro Leu Asp His Leu Trp Glu pro Tyr Asp Leu Tyr Lys His Lys 
355 360 3bi 

He Glu Arg Ala Lys Asp Gin ser Leu Leu Gly Phe Ala Ser Leu ser 
370 " 375 38° 

Glu Leu Thr Asp Asp ser ser Phe Leu 
385 390 

<210> 39 
<211> 1173 

<212> DNA L _ . 

<213> Arabidopsis thai i ana 

a£5cggt?gc gttttccgat gaaatctgcc gttttagcgt ttgctatctt tctggtgttt 60 
attccactgt tttccgtcgg tatacggatg attccgggaa gactcaccgc cgtatccgcc 120 
accgtcggaa atggctttga tctggggtcg ttcgtggaag ctccggagta cagaaacggc 180 
aaggagtgcg tgtctcaatc gttgaacaga gaaaacttcg tgtcgtcttg cgacgcttcg 240 
ttagttcatg tagctatgac gcttgactcg gagtacttac gtggctcaat cgcagccgta 
cattcaatgc tccgccacgc gtcgtgtcca gaaaacgtct tcttccatct catcgctgca 
gagtttgacc cggcgagtcc acgcgttctg agtcaactcg tccgatctac tttcccgtcg 420 
ctaaacttca aagtctacat tttccgggaa gatacggtga tcaaccttat ctcttcttca 480 
atcagacaag ctttagagaa tccattgaac tatgctcgga actacctcgg agatattctt 540 
gatccatgcg tagacagagt catttaccta gactcggaca tcatcgtcgt cgatgacata 600 
acaaagcttt ggaacacgag tttgacaggg tcaagaatca tcggagctcc ggagtattgt 660 
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cacgctaact tcacaaagta cttcacttca ggtttctggt ccgacccggc tttacccggt 720 

ttcttctcgg gtcgaaagcc ttgttatttc aacacgggtg tgatggtgat ggatctagtt 780 

agatggagag aaggaaacta cagagaaaag cttgaaactt ggatgcagat acagaagaag 840 

aagagaatct acgatttggg ttctttgcct ccgtttcttc ttgtcttcgc agggaacgtt 900 

gaagcaattg atcataggtg gaaccaacat ggtttaggag gagacaatgt acgaggaagt 960 

tgtaggtctt tgcataaagg accagtgagt ttgttgcatt ggagtggtaa aggtaagcca 1020 

tgggtgagac ttgatgagaa gagaccgtgt ccgttggatc atttatggga accgtatgat 1080 

ttatatgagc ataagattga aagagctaaa gatcagtctt tgttcgggtt ctcttctttg 1140 

tctgagttaa cagaagattc aagctttttc tga 

<210> 40 
<211> 390 
<212> PRT 

<213> Arabidopsis thai nana 
<400> 40 

Met Arg Leu Arg Phe Pro Met Lys ser Ala val Leu Ala Phe Ala lie 
15 10 15 

Phe Leu Val Phe lie Pro Leu Phe Ser val Gly He Arg Met He Pro 
20 25 30 

Gly Arg Leu Thr Ala val Ser Ala Thr val Gly Asn Gly Phe Asp Leu 
35 40 45 

Gly Ser Phe val Glu Ala Pro Glu Tyr Arg Asn Gly Lys Glu Cys val 
50 55 60 

ser Gin ser Leu Asn Arg Glu Asn Phe val ser Ser cys Asp Ala Ser 
65 70 75 80 

Leu Val His Val Ala Met Thr Leu Asp ser Glu Tyr Leu Arg Gly Ser 
85 90 95 

He Ala Ala Val His Ser Met Leu Arg His Ala Ser Cys Pro Glu Asn 
100 105 HO 

val Phe Phe His Leu He Ala Ala Glu Phe Asp Pro Ala Ser Pro Arg 
115 120 125 

Val Leu Ser Gin Leu val Arg ser Thr Phe Pro Ser Leu Asn Phe Lys 
130 135 140 

Val Tyr lie Phe Arg Glu Asp Thr Val lie Asn Leu lie Ser Ser Ser 
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150 155 160 



lie Arq Gin Ala Leu G"lu Asn Pro Leu Asn Tyr Ala Arg Asn Tyr Leu 
165 170 175 

Gly Asp lie Leu Asp Pro Cys val Asp Arg Val He Tyr Leu Asp Ser 
180 185 190 

a so He lie Val Val Asp Asp lie Thr Lys Leu Trp Asn Thr Ser Leu 
195 200 205 

Thr Gly ser Arg lie lie Gly Ala Pro Glu Tyr cys His Ala Asn Phe 
210 215 220 

Thr Lys Tyr Phe Thr ser Gly Phe Trp Ser Asp Pro Ala Leu Pro Gly 
225 230 235 *w 

Phe Phe ser Gly Arg Lys Pro Cys Tyr Phe Asn Thr Gly val Met val 
245 250 Zjj 

Met Asp Leu val Arg Trp Arg Glu Gly Asn Tyr Arg Glu Lys Leu Glu 
260 265 270 

Thr Trp Met Gin lie Gin Lys Lys Lys Arg He Tyr Asp Leu Gly ser 
275 280 285 

Leu Pro Pro Phe Leu Leu Val Phe Ala Gly Asn val Glu Ala He Asp 
290 295 300 

His Arg Trp Asn Gin His Gly Leu Gly Gly Asp Asn val Arg Gly Ser 
305 310 315 a^u 

cys Arg ser Leu His Lys Gly Pro val Ser Leu Leu His Trp Ser Gly 
7 325 330 335 

Lys Gly Lys Pro Trp val Arg Leu Asp Glu Lys Arg Pro cys Pro Leu 
340 345 350 

Asp His Leu Trp Glu Pro Tyr Asp Leu Tyr Glu His Lys lie Glu Arg 
355 360 365 

Ala Lys Asp Gin ser Leu Phe Gly Phe Ser Ser Leu Ser Glu Leu Thr 
370 375 380 



Glu Asp Ser Ser Phe Phe 
385 390 
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<210> 41 
<211> 1056 
<212> DNA 

<213> Arabidopsis thai i ana 

atggcctXa ggagcctctc ctatacacaa ctcctaggcc tcctgtcctt tatactcctc 60 
ttggtcacaa ccaccactat ggcggttcgt gttggagtca ttcttcataa gccttctgct 120 
ccaactcttc ctgttttcag agaagccccg gcttttcgaa acggtgatca atgcgggact 180 
cgtgaggctg atcagattca tatcgccatg actctcgaca caaactacct ccgtggcaca 240 
atggctgccg ttttgtctct ccttcaacat tccacttgcc ctgaaaacct ctcttttcat 
ttcctgtccc ttcctcattt cgaaaacgac cttttcacca gcatcaaatc aacctttcct 
tacctaaact tcaagattta tcagtttgat ccaaacctcg tccgcagcaa gatatcgaaa 420 
tccatcaggc aagcccttga tcagcctctt aactacgcaa gaatctacct cgcggatatc 480 
atccctagca gcgttgacag gatcatctac ttagactcag acctcgttgt ggtagacgac 540 
atagagaagc tgtggcatgt ggagatggaa ggtaaagtgg tggctgctcc cgagtactgc 
cacgcaaact tcacccatta tttcacaaga actttctggt cagacccggt attggtcaaa 
gttcttgaag gaaaacgtcc gtgttatttc aacacagggg tgatggttgt ggatgtaaac 720 
aaatggagga aaggaatgta tacacagaag gtagaagagt ggatgacaat tcagaagcag 780 
aagaggatat accatttggg atcattacct ccgtttctgc tgatattcgc cggtgatata 
aaagcggtta atcataggtg gaaccagcat ggtctaggag gtgataattt cgaaggaaga 
tgtagaacgt tgcatccggg accgataagt cttcttcact ggagtggaaa agggaagcca 
tggttaagac tagattcaag gaagccttgt atcgttgatc atctatgggc accgtatgat 1020 
ctgtaccgtt catcaagaca ttcattagaa gagtag 



<210> 42 

<211> 351 

<212> PRT . 

<213> Arabidopsis thai i ana 

<400> 42 

Met Ala Ser Arg Ser Leu Ser Tyr Thr Gin Leu Leu Gly Leu Leu ser 
1 5 

Phe lie Leu Leu Leu val Thr Thr Thr Thr Met Ala Val Arg val Gly 
20 25 30 

Val lie Leu His Lys Pro Ser Ala Pro Thr Leu Pro val Phe Arg Glu 
35 40 45 

Ala Pro Ala Phe Arg Asn Gly Asp Gin Cys Gly Thr Arg Glu Ala Asp 
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50 55 60 

Gin He His lie Ala Met Thr Leu Asp Thr Asn Tyr Leu Arg Gly Thr 
65 70 75 ou 

Met Ala Ala Val Leu ser Leu Leu Gin His Sen Thr Cys Pro Glu Asn 
85 9° 3 

Leu ser Phe His Phe Leu Ser Leu Pro His Phe Glu Asn Asp Leu Phe 
100 105 11U 

Thr ser He Lys Ser Thr Phe Pro Tyr Leu Asn Phe Lys lie Tyr Gin 
115 120 -L" 

Phe Asp Pro Asn Leu Val Arg Ser Lys He ser Lys ser He Arg Gin 
130 135 1W 

Ala Leu Asp Gin Pro Leu Asn Tyr Ala Arg lie Tyr Leu Ala Asp lie 
145 150 -»••>-> 

lie Pro ser Ser val Asp Arg He He Tyr Leu Asp Ser Asp Leu val 
165 170 

val val Asp Asp He Glu Lys Leu Trp His val Glu Met Glu Gly Lys 
180 185 J- 3 " 

val val Ala Ala Pro Glu Tyr cys His Ala Asn Phe Thr His Tyr Phe 
195 200 ' u:> 

Thr Arg Thr Phe Trp ser asj Pro val Leu Val L^s Val Leu Glu Gly 

ljs Arg Pro Cys Tyr Phe Asn Thr Gly Val Met Val val Asp Val Asn 

Lys Trp Arg Lys Gl^ Met Tyr Thr Gin L^s Val Glu Glu Trp Met Thr 

He Gin Lys Gin Lys Arg He Tyr His Leu Gly ser Leu Pro Pro Phe 
260 265 

Leu Leu lie Phe Ala Gly Asp lie Lys Ala Val Asn His Arg Trp Asn 
275 280 «•> 

Gin His Gly Leu Gly Gly Asp Asn Phe Glu Gly Arg cys Arg Thr Leu 
290 295 300 
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His Pro Gly Pro lie ser Leu Leu His Trp Ser Gly Lys Gly Lys Pro 
305 



310 315 320 



Trp Leu Arg Leu Asp ser Arg Lys Pro Cys He Val Asp His Leu Trp 
325 330 333 

Ala Pro Tyr Asp Leu Tyr Arg ser Ser Arg His Ser Leu Glu Glu 
340 345 350 

<210> 43 
<211> 1098 
<212> DNA 

<213> Arabidopsis thai i ana 

atgatgtctg gttcaagatt agcctctaga ctaataataa tcttctcaat aatctccaca 60 
tctttcttca ccgttgaatc gattcgacta ttccctgatt cattcgacga tgcatcttca 120 
gatttaatgg aagctccagc atatcaaaac ggtcttgatt gctctgtttt agccaaaaac 180 
agactcttgt tagcttgtga tccatcagct gttcatatag ctatgactct agatccagct 240 
tacttgcgtg gcacggtatc tgcagtacat tccatcctca aacacacttc ttgccctgaa 
aacatcttct tccacttcat tgcttcgggt acaagtcagg gttccctcgc caagacccta 
tcctctgttt ttccttcttt gagtttcaaa gtctatacct ttgaagaaac cacggtcaag 420 
aatctaatct cttcttctat aagacaagct cttgatagtc ctttgaatta cgcaagaagc 480 
tacttatccg agattctttc ttcgtgtgtt agtcgagtga tttatctcga ttcggatgtg 540 
attgtggtcg atgatattca gaaactatgg aagatttctt tatccgggtc aagaacaatc 
ggtgcaccag agtattgcca cgcaaatttc accaaatact tcacagatag tttctggtcc 
gatcaaaaac tctcgagtgt cttcgattcc aagactcctt gttatttcaa cacaggagtg 720 
atggttatcg atttagagcg atggagagaa ggagattaca cgagaaagat cgaaaactgg 780 
atgaagattc agaaagaaga taagagaatc tacgaattgg gttctttacc accgtttctt 
ctagtgtttg gtggtgatat tgaagctatt gatcatcaat ggaaccaaca cggtctcggt 
ggagacaaca ttgtgagtag ttgtagatct ttgcatcctg gtccggttag tttgatacat 
tggagtggta aagggaagcc atgggttagg cttgatgatg gtaagccttg tccaattgat 1020 
tatctttggg ctccttatga tcttcacaag tcacagaggc agtatcttca atacaatcaa 
gagttagaaa ttctttga 



<210> 44 
<211> 365 
<212> PRT 

<213> Arabidopsis thai nana 
<400> 44 
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Met Met ser Gly Ser Arg Leu Ala Ser Arg Leu He He He Phe ser 
1 5 10 3 

He He ser Thr ser Phe Phe Thr Val Glu Ser lie Arg Leu Phe Pro 
20 25 3U 

Asp ser Phe Asp Asp Ala Ser Ser Asp Leu Met Glu Ala Pro Ala Tyr 
35 40 4 -> 

Gin Asn Gly Leu Asp cys ser Val Leu Ala Lys Asn Arg Leu Leu Leu 
50 55 oo 

Ala Cys Asp Pro ser Ala val His He Ala Met Thr Leu Asp Pro Ala 

Tyr Leu Arg Gly Thr Val ser Ala Val His Ser He Leu Lys His Thr 
85 90 ^ D 

ser cys Pro Glu Asn He Phe Phe His Phe He Ala ser Gly Thr ser 
100 105 x 

Gin Gly Ser Leu Ala Lys Thr Leu ser Ser val Phe Pro ser Leu Ser 
115 120 

Phe Lys val Tyr Thr Phe Glu Glu Thr Thr val Lys Asn Leu lie ser 
130 135 ■ L4U 

ser ser He Arg Gin Ala Leu Asp ser Pro Leu Asn Tyr Ala Arg ser 

145 150 15b 

Tyr Leu ser Glu He Leu Ser ser cys val ser Arg val He Tyr Leu 

165 I 70 

Asp ser Asp val He Val val Asp Asp He Gin Lys Leu Trp Lys He 
180 18 d 

ser Leu ser Gly ser Arg Thr lie Gly Ala Pro Glu Tyr Cys His Ala 
195 200 ^ u:> 

Asn Phe Thr Lys Tyr Phe Thr Asp ser Phe Trp Ser Asp Gin Lys Leu 
210 215 ^ u 

ser ser val Phe Asp Ser Lys Thr Pro cys Tyr Phe Asn Thr Gly Val 
225 230 235 

Met val He Asp Leu Glu Arg Trp Arg Glu Gly Asp Tyr Thr Arg Lys 
245 250 255 
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lie Glu Asn Trp Met Lys lie Gin Lys Glu Asp Lys Arg lie Tyr Glu 
260 265 270 

Leu Gly Ser Leu Pro Pro Phe Leu Leu val Phe Gly Gly Asp lie Glu 
275 280 285 

Ala He Asp His Gin Trp Asn Gin His Gly Leu Gly Gly Asp Asn He 
290 295 300 

val ser ser Cys Arg ser Leu His Pro Gly Pro val ser Leu He His 
305 310 315 a-dO 

Trp Ser Gly Lys Gly Lys Pro Trp Val Arg Leu Asp Asp Gly Ly| Pro 

cys Pro lie Asp Tyr Leu Trp Ala Pro Tyr Asp Leu His Lys ser Gin 
3 340 345 350 

Arg Gin Tyr Leu Gin Tyr Asn Gin Glu Leu Glu He Leu 
355 360 365 

<210> 45 

<211> 1026 

<212> DNA , 

<213> Arabidopsis thai i ana 

atgcactcga agtttatatt atatctcagc atcctcgccg tattcaccgt ctctttcgcc 60 

ggcggcgaga gattcaaaga agctccaaag ttcttcaact ccccggagtg tctaaccatc 120 

gaaaacgatg aagatttcgt ttgttcagac aaagccatcc acgtggcaat gaccttagac 180 

acagcttacc tccgtggctc aatggccgtg attctctccg tcctccaaca ctcttcttgt 240 

cctcaaaaca ttgttttcca cttcgtcact tcaaaacaaa gccaccgact ccaaaactac 

gtcgttgctt cttttcccta cttgaaattc cgaatttacc cttacgacgt agccgccatc 

tccggcctca tctcaacctc catccgctcc gcgctagact ctccgctaaa ctacgcaaga 420 

aactacctcg ccgacattct tcccacgtgc ctctcacgtg tcgtatacct agactcagat 

ctcatactcg tcgatgacat ctccaagctc ttctccactc acatccctac cgacgtcgtt 

ttagccgcgc ctgagtactg caacgcaaac ttcacgactt actttactcc gacgttttgg 

tcaaaccctt ctctctccat cacactatcc ctcaaccgcc gtgctacacc gtgttacttc 

aacaccggag tgatggtcat cgagttaaag aaatggcgag aaggagatta cacgaggaag 720 

atcatagagt ggatggagtt acaaaaacgg ataagaatct acgagttagg ctctttacca 780 

ccgtttttac ttgtcttcgc cggaaacata gctccggtag atcaccggtg gaaccaacac 840 
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ggtttaggag gagataattt tagaggactg tgtcgagatt tgcatccagg tccagtgagt 900 
ttgttgcatt ggagtgggaa agggaagcca tgggtaaggt tagatgatgg tcgaccttgc 960 
ccgcttgatg cactttgggt tccatatgat ttgttagagt cacggttcga ccttatcgag 1020 
agttaa 

<210> 46 

<211> 341 

<212> PRT 

<213> Arabidopsis thanana 

<400> 46 

Met His ser Lys Phe He Leu Tyr Leu Ser lie Leu Ala Val Phe Thr 
1 5 10 J-^ 

val ser Phe Ala Gly Gly Glu Arg Phe Lys Glu Ala Pro Lys Phe Phe 
20 25 

Asn Ser Pro Glu cys Leu Thr lie Glu Asn Asp Glu Asp Phe val Cys 
35 40 45 

ser Asp Lys Ala lie His val Ala Met Thr Leu Asp Thr Ala Tyr Leu 
50 55 60 

Arg Gly ser Met Ala Val He Leu ser val Leu Gin His ser Ser gjs 

Pro Gin Asn lie Val Phe His Phe Val Thr Ser Lys Gin ser His Arg 
85 90 9:> 

Leu Gin Asn Tyr val val Ala ser Phe Pro Tyr Leu Lys Phe Arg He 
100 105 HO 

Tyr Pro Tyr Asp Val Ala Ala lie ser Gly Leu lie Ser Thr Ser He 
115 120 125 

Arq ser Ala Leu Asp Ser Pro Leu Asn Tyr Ala Arg Asn Tyr Leu Ala 
130 135 140 

Asj> lie Leu Pro Thr gs Leu ser Arg val Val Tyr Leu Asp ser Asp 

Leu He Leu Val Asp Asp lie Ser Lys Leu Phe Ser Thr His lie Pro 
165 170 1'5 

Thr Asp Val Val Leu Ala Ala Pro Glu Tyr cys Asn Ala Asn Phe Thr 
180 185 190 
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Thr Tyr Phe Thr Pro Thr Phe Trg ser Asn Pro Ser Leu Ser He Thr 

Leu ser Leu Asn Arg Arg Ala Thr pro Cys Tyr Phe Asn Thr Gly val 
210 215 

Met val He Glu Leu Lys Lys Trp Arg Glu Gly Asp Tyr Thr Arg Lys 
225 230 235 

He He Glu Trp Met Glu Leu Gin Lys Arg He Arg He Tyr Glu Leu 
245 250 "a 

Gly ser Leu Pro Pro Phe Leu Leu val Phe Ala Gly Asn lie Ala Pro 
' 260 265 z/u 

Val Asp His Arg Trp Asn Gin His Gly Leu Gly Gly Asp Asn Phe Arg 
275 280 

Gly Leu cys Arg Asp Leu His Pro Gly Pro Val ser Leu Leu His Trp 
290 295 iW 

ser Gly Lys Gly Lys Pro Trp val Arg Leu Asp Asp Gly Arg Pro cjs 

Pro Leu Asp Ala Leu Trp Val Pro Tyr Asp Leu Leu Glu Ser Arg Phe 
325 jou JJJ 

Asp Leu He Glu ser 
340 

<210> 47 
<211> 1086 

<212> DNA . , , . 

<213> Arabidopsis than ana 

StJrtttJJa tcatgagatt ctccggttta ttctccgccg ctttggttat catcgtcctc 60 
tctccttctc tccaatcgtt tcctccagct gaagctatca gatcctctca tctcgacgct 120 
tacctccgtt tcccctcctc cgatccaccg ccgcatagat tctccttcag aaaagctcct 
gttttccgca atgccgccga ttgcgccgcc gcagatatcg attccggcgt ctgtaaccct 
tccttggtcc acgtcgcgat tactctcgat ttcgagtacc tgcgtggctc aatcgccgcc 
gttcattcga ttctcaagca ctcgtcgtgt cccgagagcg tcttcttcca tttcctcgtc 
tccgagactg acctagaatc cttgattcgt tcgacttttc ccgaattgaa attaaaggtt 
tactacttcg atccggagat tgtacggacg ctgatctcaa _cctccgtgag acaagcgctc 
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gagcagccgt tgaattacgc tagaaattac ctagctgacc ttctcgagcc ttgcgtgcgt 
cgcgtgatct acctagattc cgatctaatc gtcgtcgacg acatcgcaaa gctctggatg 
acgaaactgg gatcgaaaac gatcggagct cccgagtact gtcacgcgaa cttcacaaag 
tatttcacac cggcgttctg gtccgacgag aggttctccg gagctttctc cgggaggaaa 720 
ccgtgctact tcaacacggg agtgatggtg atggatctag agagatggag gcgcgtaggg 780 
tacacggagg tgatagagaa atggatggag attcagaaga gtgataggat ttacgagctg 
ggatcattgc cgccgttctt gttggtgttc gccggagaag tagctccgat agagcatcgg 
tggaaccagc atgggcttgg tggagataac gtgagaggaa gctgtagaga tttacatccc 
ggtccggtta gcttgcttca ttggtccggt agtggtaaac cgtggtttcg gttagattcg 1020 
agacggcctt gtccacttga tactctttgg gcaccttatg atttgtatgg acactactct 1080 
cgctga 

<210> 48 

<211> 361 

<212> PRT . 

<213> Arabidopsis thai nana 

<400> 48 

Met Leu Trp He Met Arg Phe Ser Gly Leu Phe ser Ala Ala Leu Val 
1 5 10 X3 

He lie val Leu ser pro Ser Leu Gin ser Phe Pro Pro Ala Glu Ala 
20 25 5U 

He Arg ser ser His Leu Asp Ala Tyr Leu Arg Phe pro ser ser Asp 
35 40 4i 

Pro Pro Pro His Arg Phe ser Phe Arg Lys Ala Pro val Phe Arg Asn 
50 55 oU 

Ala Ala Asp cys Ala Ala Ala Asp He Asp ser Gly val cys Asn Pro 
65 70 

ser Leu val His val Ala He Thr Leu Asp Phe Glu Tyr Leu Arg Gly 
85 90 y5 

Ser He Ala Ala Val His ser He Leu Lys His ser ser cys Pro Glu 
100 105 

ser val Phe Phe His Phe Leu Val ser Glu Thr Asp Leu Glu ser Leu 
115 120 125 

page 72 



WO 2004/072250 PCT/US2004/003545 

14-03 WO. sequence "listing.txt 
lie Arg Ser Thr Phe Pro Glu Leu Lys Leu Lys val Tyr Tyr Phe Asp 
130 135 140 

pro Glu lie val Arg Thr Leu He Ser Thr ser val Arg Gin Ala Leu 
145 150 155 160 

Glu Gin Pro Leu Asn Tyr Ala Arg Asn Tyr Leu Ala Asp Leu Leu Glu 
165 170 175 

pro cys val Arg Arg val lie Tyr Leu Asp Ser Asp Leu lie val val 

Asp Asp He Ala Lys Leu Trp Met Thr Lys Leu Gly ser Lys Thr lie 
195 200 205 

Gly Ala Pro Glu Tyr Cys His Ala Asn Phe Thr Lys Tyr Phe Thr Pro 
210 215 220 

Ala Phe Trp ser Asp Glu Arg Phe Ser Gly Ala Phe Ser Gly Arg Lys 
225 230 235 <24U 

pro Cys Tyr Phe Asn Thr Gly Val Met Val Met Asp Leu Glu Arg Trp 
' 245 250 255 

Arg Arg Val Gly Tyr Thr Glu val lie Glu Lys Trp Met Glu He Gin 

Lys Ser Asp Arg He Tyr Glu Leu Gly ser Leu Pro Pro Phe Leu Leu 
275 280 285 

val Phe Ala Gly Glu Val Ala Pro lie Glu His Arg Trp Asn Gin His 
290 295 300 

Gly Leu Gly Gly Asp Asn Val Arg Gly ser cys Arg Asp Leu His Pro 
305 310 315 320 

Gly Pro val Ser Leu Leu His Trp ser Gly ser Gly Lys Pro Trp Phe 
y 325 330 335 

Arg Leu Asp Ser Arg Arg Pro cys pro Leu Asp Thr Leu Trp Ala Pro 
340 345 350 

Tyr Asp Leu Tyr Gly His Tyr Ser Arg 
355 360 

<210> 49 
<211> 1041 
<212> DNA 
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<210> 50 
<211> 346 
<212> PRT 

<213> Arabidopsns thai i ana 
<400> 50 

Met Leu Trp He Thr Arg Phe Ala Gly Leu Phe ser Ala Ala Met Ala 
1 5 10 

val lie val Leu Ser Pro ser Leu Gin ser Phe Pro Pro Ala Ala Ala 
20 25 30 

He Arg ser Ser Pro ser Pro He Phe Arg Lys Ala Pro Ala val Phe 
35 40 45 

Asn Asn Gly Asp Glu cys Leu Ser ser Gly Gly Val Cys Asn Pro ser 
50 55 60 



300 
360 



14-03 WO. sequence listing.txt 

<213> Arabidopsis thai i ana 

atgctttgga taacgagatt tgctggatta ttctccgccg cgatggcagt gatcgtgtta 60 
tctccgtcgc ttcagtcatt tcctccggcg gcggcaatcc gttcttctcc atcaccgatc 120 
ttcagaaaag ctccagcggt gttcaacaac ggcgacgaat gtctctcctc cggcggcgtc 180 
tgcaatccgt cgttggtcca cgtggcgatc acgttagacg tagagtacct gcgtggctca 240 
atcgcagccg ttaactcgat ccttcagcac tcggtgtgtc cagagagcgt cttcttccac 
ttcatcgccg tctccgagga aacaaacctg ttggagtcgc tggtgagatc ggttttcccg 
agactgaaat tcaatattta cgattttgcc cctgagacag ttcgtggttt gatttcttct 420 
tccgtgagac aagctctcga gcagcctctg aactacgcta gaagctactt agcggatctg 480 
ctggagcctt gtgttaaccg tgtcatatac ttggattcgg atcttgtcgt cgtcgatgac 540 
atcgctaagc tttggaaaac tagcctaggc tcgaggataa tcggagctcc ggagtattgt 
cacgcgaatt tcacgaaata cttcaccgga ggattctggt cggaggagag attctccggt 
acctttagag ggaggaagcc atgttacttc aacacaggtg tgatggtgat agatcttaag 720 
aaatggagaa gaggtggtta cacgaaacgt atcgagaaat ggatggagat tcagagaaga 780 
gagaggattt acgaactagg ctcgcttcca ccgtttcttc tagttttctc cggtcacgtg 
gctcccatct ctcaccggtg gaaccagcat ggacttggtg gtgacaatgt tagaggtagc 
tgtcgtgatt tgcatcctgg tcctgtgagt ttgctgcatt ggtctggtag tggcaagccc 960 
tggataagac tcgattccaa acggccttgt cccttagacg cattatggac gccttacgac 1020 
ttgtatcgac attcgcattg a 
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Leu val His val Ala He Thr Leu Asp val Glu Tyr Leu Arg Gly ser 
65 70 75 80 

He Ala Ala val Asn ser He Leu Gin His ser val cys Pro Glu Ser 
85 90 95 

Val Phe Phe His Phe lie Ala Val ser Glu Glu Thr Asn Leu Leu Glu 
100 105 HO 

ser Leu val Arg Ser Val Phe Pro Arg Leu Lys Phe Asn lie Tyr Asp 
115 ~ 120 125 

Phe Ala Pro Glu Thr Val Arg Gly Leu He Ser ser ser val Arg Gin 
130 135 140 

Ala Leu Glu Gin Pro Leu Asn Tyr Ala Arg Ser Tyr Leu Ala Asp Leu 
145 150 155 

Leu Glu Pro cys val Asn Arg Val lie Tyr Leu Asp Ser Asp Leu Val 
165 170 175 

val val Asp Asp He Ala Lys Leu Trp Lys Thr Ser Leu Gly ser Arg 
180 185 190 

lie He Gly Ala Pro Glu Tyr Cys His Ala Asn Phe Thr Lys Tyr Phe 
195 200 205 

Thr Gly Gly Phe Trp Ser Glu Glu Arg Phe Ser Gly Thr Phe Arg Gly 
210 215 220 

Arjj Lys Pro cys Tyr Phe Asn Thr Gly Val Met Val He Asp Leu Lys 

Lys Trp Arg Arg Gly^ Gly Tyr Thr Lys Arg He Glu Lys Trp Met Glu 

lie Gin Arg Arg Glu Arg lie Tyr Glu Leu Gly Ser Leu Pro Pro Phe 
260 265 £tv 

Leu Leu val Phe ser Gly His val Ala Pro He Ser His Arg Trp Asn 
275 280 285 

Gin His Gly Leu Gly Gly Asp Asn val Arg Gly Ser Cys Arg Asp Leu 
290 295 300 

His Pro Gly Pro val Ser Leu Leu His Trp ser Gly Ser Gly Lys Pro 
305 310 315 
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Trp He Arg Leu Asp ser Lys Arg Pro Cys Pro Leu Asp Ala Leu Trp 



Thr Pro Tyr Asp Leu Tyr Arg His ser His 
340 345 
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